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(54) Title: DNA SEQUENCING BY MASS SPEClROMEntY 
(ST) Abstract 

. . . describes a new method to sequence DNA TTie Improvements over the existing DNA sequencing technologies are 

High speed, high throughput, no electrophoresis and gel reading aitilewts due to the complete absence of an eteetrophoretk step, and no 
costly reagents involving various substituUons with stable isotopes. The invention utilizes the Sanger sequencing stiaiegy ai^assemblcs i 
the sequence mfomiation by analysis of the nested tiagments obtained by base-specific chain tennlnation via tbdr dMtont molecular 
masscsusmg mass A ftathcr taoease in ihniu^ut can be obtamed 

by introducing mass-modifications In the oUgonncleotlde primer, dtain-teiminating nucleoside tripbosfriiates and^or in the chain-etongathis 
ilir^Slr "f "^K^ integrated tMg sequences which allow mulUplexing by hybiidizatlon of tag specific probes with 

mass Qinerentisicy moieculer wei^ts. 
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DNA SEQUENCING BY MASS SPECTROMETRY 

Related Applications 

This application is a contmuation-in-part of U.S. Application Serial Number 
08/617,010, which is a continuation-in-part of U.S. Application Serial Number 
08/178,216, which issued as U.S. Patent No. 5,547,835, and whidi itself is a 
continuation-in-part of U.S. Application Serial Number 08/001,323 filed Januaiy 7, 1993, 
which is now abandoned. The contents of all related applications are incorporated herdn 
by reference. 

Backeround of thg Invcntifin 

Since the genetic information is rq>resented by the sequence of the four 
DNA building blocks depxyadenosine- (dpA), deoxyguanoane- (dpG), deoxycytidine- 
(dpC) and deoxythymidine-5*-phosphate {dpT). DNA sequendng is one of the most 
fundamental tedmologies in molecular Uolo^ and the Ufe sciences in general. The case 
and the rate by which DNA sequences can be obtained greatly afiEects related 
technologies such as development and production of new therapeutic agents and new and 
useful varieties of plants and microorganisms via recombinant DNA technology. In 
particular, unravding the DNA sequence he^s in undmianding human pathological 
conditions including gmetic disorders, cancer and AIDS. In some cases, very subtle 
differences such as a one nucleotide ddetion, addition or substitution can create serious, 
in some cases even fital, consequences. Recently, DNA sequencing has become the core 
technology of the Human Genome Sequencing Project (e.g., J.E. Bishop and M. 
Waldholz, 1991, Genome: The Storv of the Most Astnnishinp f^c^ mtifia Arivi^ttm. nf 
Our Hmg - The Attempt to Mao All the Genes in the H n man RnHy^ Simon & Schuster, 
New York). Knowledge of the complete human genome DNA sequence will certainly 
help to understand, to diagnose, to prevent and to treat human diseases. To be able to 
tackle successfully the determination of the approximately 3 btlUon base pairs of the 
human genome in a reasonable time frame and in an economical way, rapid, reliable. 
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mediated by DNA polymerase (Labeit el al., DMA L 173-177 (1986); Amersham, PCT- 
AppUcation GB86/00349; Eckstein et al. Nucleic Adds Rp,.; J[^, 9947 (1988)). Here, 
the four sets ofbase-specific sequencing ladders are obtained by limited digestion with 
exomiclease in or snake venom phosphodiesterase, subsequent separation on PAGE and 
visualization by radioisotopic labeling ofdther the prima- or one of the dNTPs. In a 
fiirther modification, the base-specific cleav^e is achieved by alkylating the sulphur atom 
in the modified phoq>hodiester bond followed by a heat treatment (Max-Planck- 
- GeseBschaft, DE 39303 12 Al). Both methods can be combined with the amplification of 
the DNA via the Polymerase Chain Reaction (PGR). 

On the upfixmt en^ the DNA to be sequenced has to be fiagmented mto 
sequencable pieces of currently not more than 500 to 1000 nucleotides. Starting fi^om a 
genome, this is a multi-rstep process involving cloning and subcloning steps using different 
and appropriate cloning vectors such as YAC. cosmids, plasmids and Ml 3 vectors 
(Sambrook et al. Molecular Cloning A l ahft ratnrv Manual Cold Spring Harbor 
Laboratory Press, 1989). Finally, for Sanger sequencing, the fragments of about 500 to 
1000 base pairs are integrated into a specific restriction site of the replicative form I (RF 
1) of a derivative of the M13 bacteriophage (Vieria and Messing. fifin6 l9, 259 (1982)) 
and then the double-stranded form is transformed to the single-stranded circular form to 
serve as a template for the Sanger sequencing process having a binding site for a 
universal primer obtained by chemical DNA synthesis (Sinha, Biemat, McManus and 
Kester, NMCleic Acids Rgs 12, 4539-57 (1984); U.S. Patent No. 4725677 upstream of 
the restriction site into which the unknown DNA fi-agmerit has been inserted. Under 
specific condhions, unknown DNA sequences integrated into superc(Mled double- 
stranded plaanid DNA can be sequenced directly by the Sanger method (Chen and 
Seeburg. I2MA4. 165-170 (1985)) and Urn «/a£. G««A«.I T^h.^ ^ 32-39 (1988). 
and. with the Polymerase Chain Reaction (PCR) fPCR PmtomU- a fi.»rfi> Mlhft flff 
ai»d Applications. Innis etal., editors, Acadnnic Press. San Di^o (1990)) cloning or 
subcloning steps could be omitted by directly sequencing off chromosomal DNA by first 
amplifying the DNA segment by PCR and then applying the Sanger sequencing method 
(Innis et al, Froc. Watl. Acad. Sci. USA S5, 9436-9440 ( 1 988)). In this case, however, 
the DNA sequence in the interested region most be known at least to the extent to bind a 
sequencing primer. 
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In order t be able to read the sequence from PAGE, detectable labels have 

to be used in either the primer (very ften at the 5 -end) or in one of the deoxynucleoside 

32 33 35 

triphosphates, dNTP. Using radioisotopes such as P, P, or S is still^he most 
frequently used technique. After PAGE^ the gels are exposed to X-ray jfilms and silver 
S grain exposure is analyzed. The use of radioisotopic labeling creates several problems. 
Most labels useful for autoradiographic detection of sequencing fragdnents have 

relatively short half-lives which can limit the useful time of the labels. The emission high 

. . 32 

energy beta radiation, particularly from P» can lead to breakdown ofthe products via 

radiolysis so that the sample should be used very quickly after labding. In addition, high 

10 energy radiation can also cause a deterioration of band sharpness by scattering. Some of 

33 35 

these problems can be reduced by using the less energetic isotopes sudi as P or S 
(see, e.g., Omstdn et al, Biotechhigues 2, 476 (1985)). Here, however, longer exposure 
times have to be tolerated. Above all, the use bf radioiisotopes poses significant health 
rides to the experimentalist and, in heavy sequencing projects, decontamination and 

15 handling the radidactive waste are other severe problems and burdens. 

In response to the above mentioned problems related to the use of 
radioactive labels, non-radioactive labeling techniques have been explored and, in recent 
years, integrated into partly automated DNA sequencing procedures. All these 
improvements utilize the Sanger sequencing strategy. The fluorescent label can be tagged 

20 to the primer (Smith et al . 221, 674-679 ( 1 986) and EPO Patent No. 

87300998.9; Du Pont De Nemours EPO Application No. 0159225, Ansorge et al. JL 
Biochmi Biophys Methods 12, 325-32 (1986)) or to tiie chain-terminating 
dideoxyraicloside triphosphates (Prober et al Spience 238 , 336-41 (1987); Applied 
Biosystems, PCT Application WO 91/05060). Based on dthor labeling the primer or the 

25 ddNTP» systems have been developed by Applied Biosystems (Smith et aL^ Science 235, 
G89 (1987); U.S. Patent Nos. 570973 and 689013), Du Pont DcNwnours (Prober a/. 
Science 238, 336,341 (1987); U.S. Patents Nos. 881372 and 57566), Pharmacia-LKB 
(Ansorge et al. Nucleic Adds Res li, 4593-4602 (1 987) and EMBL Patent Application 
DE P3724442 and P3805808. 1) and Hitachi (IP 1-90844 and DE 401 1991 Al), A 

30 somewhat similar approach was developed by Brumbaugh et al. fProc. Natl. Sd. USA 
£^ 5610-14 (1988) and U.S. Patent No. 4,729,947). An improved method for the Du 
Pont system using two dectrophoretic lanes \^th two different spedfic labels per lane is 
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described (PCT Application WO92/02635). A different approach uses fluoresccntly 
labeled avidin and bi tin labeled prime's. Here, the sequencing ladders ending with biotin 
are reaaed during dectrophor^is with the labeled avidin which results in-the detection of 
the individual sequencing bands (Bnimbaugh et al, U.S. Patent No. 594676). 

More recently even more sensitive non-radioacth^e bbeling techniques for 
DNA using chemiluminescence triggerable and amplifyable by enzymes have hem 
developed (Becfc, OKcefe, CouU and Koster, Nucldc Adds Res 17 51 15-5123 (1989) 
' aiid Beck and Kdster, Anal Chem S2. 2258-2270 (1990)). These labeling methods were 
combined with multiplex DNA sequencing (Church ei aL Science 240. 185-188 (1988) to 
provide fora strategy auned at high throughput DNA sequncing (Kdster ei aL, 
Nucleic Acids Res. Symposium Ser No 24 318^321 (1991), University of Utah, PCT 
Application No. WO 90/15883); this strategy still suffers from the disadvantage of being 
very laborious and dtfficuh to automate. 

In an attempt to simplify DNA sequencing, solid supports have been 
introduced. In most cases published so far, the template strand for sequencing (with or 
without PGR amplification) is immobilized on a solid support most frequently utilizing 
the strong biotin-avidin/strcptavidin interaction (Orion- Yhtyma Oy, U.S. Patent No. 
277643; M. IMenetal. Nucleic Add<; Re^ 16 in:>s,i« (lOftR); r^m,. ninf^^r>\\r prj 
Application No. WO 89/09282 and Medical Research Council, GB, PCT Apph'cation No. 
WO 92/03575). The primer extension products synthesized on the immobilized template 
strand are purified of enzymes, other sequ^ing reagents and by-products by a washing 
step and then released.under denaturiing conditions by loosing the l^drogen bonds 
between the Watson-Crick base pairs and subjected to PAGE sq)aration. In a different 
approach, the primer extension products (not the template) from a DNA sequencing 
reaction are bound to a solid support via biotin/avidin (Du Pont Dc Nwnours, PCT 
Application WO 91/1 1533). In contrast to the above mentioned methods, here, the 
interaction between biotin and avidin is overcome by employing denaturing conditions 
(formamide/EDTA) to release the primer extension products of the sequencing reaction 
from the solid support for PAGE separation. As solid supports, beads, (e.g., magnetic 
beads (Dynabeads) and Sepharose beads), filters, capillaries, plastic dipsticks (e.g., 
polystyrene strips) and microtiter wells are being proposed. 
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All methods discussed so far have one central step in common: 
polyacrylamide gel electrophoresis (PAGE). In many instances, this represents a major 
drawback and limitation for each of these methods. Preparing a homogeneous gel by 
]X)}ymerizati6n, loading of the samples^ the electrophoresis itseli^ detection of the 
S seqiimce pattern (e.g.» by autoradiography), removing the gel and cleaning the glass 
plates to prepare another gel are very laborious and time-consuming procedures. 
Moreover, the whole process is error-pnme, difficult to automate, and, in order to 
improve rq)roducibility and reliability, highly trained and skilled personnel are required. 
In the case of radioactive labeling, autoradiography itsdf can consume from hoars to 

10 days^. In the case of fluorescent labding, at least the dietection of the sequendng bands is 
bdhg petformed amdmatically when u^ng the laser-scanning devices integrated into 
commercial available DNA sequencers. One problem related to the fluorescent labeling is 
the influence of the four different base-spedfic fluorescent tags on the mobility of the 
fragments during dectrophore^s and a possible overlap in the spectral bandwidth of the 

1 5 four specific dyes reducing the discriminating power between neighboring bands, hence, 
increasing the probability of sequence ambiguities. Artifacts are also produced by base- 
specific interactions with the polyacoiamide gel matrix (Frank and Kdster, Nucteig 
Acids Res: 6, 2069 ( 1 979)) and by the formation of secondary structures which result in 
"band compressions" and hence do not allow one to read the sequence. This problem 

20 has, in part, been overcome by using 7-deazadeoxyguanosine triphosphates (Barr et al^ 
Biotechniques 4. 428 (1986)). However, the reasons for some artifacts and conspicuous 
bands are still under investigation and need further improvement of the gel 
dectrophoretic procedure. 

A recent innovation in dectrojphoresis is capillary zohe el^^^ 

25 (CZE) (Jorgcnsbn et aL, J Chromatogniphv 252, 337 (1986); Gestdaiid et al. 

Nucleic Adds Resi IS, 1415-1419 (1990)) whidi, compared to slab gd electrophoresis 
(PAGE)i significantly increases the resolution of the separation, reduces the time for an 
dectrophoretic run and diows the andyds of very smdl samples. Here, however, other 
problems arise due to the miniaturizatton of the whole system such as wall effects and the 

30 necessity of highly sensitive on-line detection methods. Compared to PAGE, another 

drawbadc is created by the &ct that CZE is only a **one-lane" process, whereas in PAGE 
samples in multiple lanes can be electrophoresed simultaneously. 
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Due to the severe limitations and probi^ns related to having PAGE as an 
integral and central part in the standard DNA sequencing protocol, several methods have 
been proposed to do DNA sequencing without an electrophoretic step. One approach 
calls for hybridization or fragmentation sequencing (Bams, BiotechnQlogy ig, 757-58 
(1992) and Mirzabekov et aI..¥EES Letters ^S6 1 18-122 (1989)) utilizing the specific 
hybridization of known short oligonucleotides (e g., octadeoxynudeotides which gives 
: :65;536 different sequences) to a complementaiy DNA sequence. Positive hybridization 
reveals a short stretch of the unknown sequence. Rq>eatjng this process by performing 
hybridizations with all possible octadeoxynudeotides should theoretically detmnine the 
sequence. In a completely dtfTermt approach, rapid sequencing of DNA is done by 
unilaterally degrading one single, immobilized DNA fragment by an wconudease in a 
moving flow stream and detecting the deaved nucleotides by their spedfic fluorescent tag 
via laser exdtation (Jett et aL, J. Biomolecular Stmchire^ Pyf?^TTrS 2, 301-309, 
(1989); United States Department of Energy, PCT AppUcation No. WO 89/03432). In 
another system proposed by Hyman (Anal Biochem 174 ^ 423^36 (1988)), the 
pyrophosphate generated when Ae correct nudcotidc is attached to the growing chain on 
a primer-template system is used to determine the DNA sequence. The enzymes used 
and the DNA are held in place by solid phases (DEAE-Sepharose and Sepharose) either 
by ionic interactions or by covalent attachment. In a continuous flow-through system, the 
amount of pyrophosphate is determined via bioluminescence (luciferase). A synthesis 
approach to DNA sequencing is also used by Tsien et ai (PCT Application No. WO 
91/06678), Here, the incoming dNTPs are protected at the 3*-end by various blocking 
groups such as acetyl or phosphate groups and are removed before the next dongation 
step, which makes this process very slow compared to standard sequencing methods. 
The template DNA is inmiobilized on a polymer support. To detect incorporation, a 
fluorescent or radioactive label is additionally incorporated into the modified dNTP's. 
The same patent appGcatton also describes an apparatus designed to automate the 
process. 

Mass spectrometry, in general, provides a means of '"weighing" individual 
molecules by ionizing the molecules in vacuo and making them "fly" by volatUization. 
Under the influence of combinations of electric and magnetic fields, the ions follow 
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trajectories depending on their individual mass (m) and charge (z). In the range of 
molecules with low molecular weight, mass spectrometry has long been^part of the 
routine phy^cal-organic rq>ertoire for analysis and characterization of organic molecules 
by the detenhination of the mass of the parent molecular ion. In addstton, by arranging 
collisions of tlus parent molecular ion with other partides (e.g.^ argon atoms), the 
molecular ton is fiiijgmented forming secondary ions by the so-called collision induced 
di^dation (CID). The fragmentation pattern/pathway very often allows the derivation 
of detiasled structural information. Many aipplicattons of mass spectrom^ric m^ods in 
the known in the art, particulariy in biosciences, and can be found sumrharized in 
MethodiB in Eniymologv. Vol. 193: TMass Spectrbiiietfy'* (J A. McCloskey, editor), 
1990, Academic Press. New York. 

Due to the apparent analytical advantages of mass spectrometry in 
providing high detection sensitivity, accuracy of mass measurements, detailed structural 
information by CID in conjiiiiction with an MS/MS configiuration arid speed, as well as 
on-line data transfer to a computer, there has been considerable interest in the use of 
mass spectrometry for the structural analysis of nucidc acids. Recent reviews 
summarizing this field include K. H. Schram, "Mass Spectrometry of Nucleic Acid 
Components, Biomedical Applications of Mass Spectrometry" 24, 203-287 (1990); and 
P.F. Grain, "Mass Spectrometric Techniques in Nucleic Acid Research/ MaS5 
Spectrometry Reviews 9. 505-554 (1990). The biggest hurdle to applying mass 
spectrometry to nucleic acids is the difficulty of volatilizing these very polar biopolymers. 
Therefore, "sequencing** has been limited to low molecular weight synthetic 
oligonucleotides by d^ermining the rnass of the parent molecular ion and through this, 
confirming the iab'eady known sequence, or alternatively, confirming the known sequence 
thh>tigh the gerieratiori of secdndaiy ions (fragment ions) via CID in an MS/MS 
configuration uttliang, in particular, for the ionization and volatilization, the m^hod of 
fiist at6riuc bombardment (FAB mass spectrometry) or plasma desorption (PD mass 
spectrometry). As an example, the application of FAB to the analysis of protected 
dimeric blocks for chemical synthesis of oligodeoxynucleotides has been described 
(KOster et al Rinmedicat Environmental Mass Spectrometry H> H 1-1 16 (1987)). 
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Two more recent ionization/desorption techniques are electrospray/ionspray 
(ES) and matrix-assisted laser desorption/ionization (MALDI). ES mass-spectronietry 
has been introduced by Fenn et aL f J Vhiiis. nh«n fig, 4451-59 (1984); PCT Application 
No. WO 90/14148) and current applications are sununaiized in recent review articles 
flLD. Smith efo/.- Anal rhcm 882-89 (1990) and B. Ardrey, Electrospray Mass 
Spectrometiy, Spectroscopy Eumpfi. H 0-l8 (1992)). The molecular weights of the 
"tetradepamideotidc d(CATGCCATGGCATG) (SEQ ID NO: 1 ) (Covey et aL "The 
Determination of Protdn, Oligonucleotide and Peptide Molecular Weights by lonspray 
Mass Spectrometry," Rapid Communicatinns in Mass Spectromfttty, 2, 249-256 (1988)). 
of the 21-mer d(AAATTGTCjCACATCCTGCAGC) (SEQ ID N0:2) and without giving 
details of that of a tRNA with 76 nucleotides rMethods in Rii7vmf.l<^pv 122, "Mass 
Spectrometry" (McCloskey, editor), p. 425, 1990, Academic Press, New York) have 
been published. As a mass analyzer, a quadrupole is most frequently used. The 
detennination of molecular weights in femtomole amounts of sample is very accurate due 
to the presence of multiple ion peaks which all could be used for the mass calculation. 

MALDI mass spectrometry, in contrast, can be particularly attractive when 
a time-of-flight (TOF) configuration is used as a mass analyzer. The MALDI-TOF mass 
spectrometry has been introduced by Hillenkamp et aL ("Matrix Asasted UV-Laser 
Desorption/Ipnizadon: A New A|)proad) to Mass Spectrometry of Large Biomolecules," 
Bioloeical Ma.« .Speftrmmfttiy (Burlifigame and McClodcey, editors). Eisner Science 
Publishera* Aimtefdam, pp. 49-60, 1990.) Since.;in most cases, no multiple molecular 
ion peaks are produced with this technique, the mass spectra, in principle, look ampler 
.compared to ES mass spectrometry. Although DNA molecules up to a molecular weight 
of 410,000 dahons could be desorbed and volatilized (Williams et aL, "Volatilization of 
Kgh Molecular Weight DNA by Pulsed Laser Abhttion of Frozen Aqueous Solutions." 
SsasnCfi. 24fi. 1585-87 (1989)), this technique has so far only been used to determine the 
molecular weights of relatively small oligonucleotides of known sequence, e.g.. 
oligothymidylic acids up to 1 8 nucleotides (Huth-Fehre et al., "Matrix-Assisted Laser 
Desorption Mass Spectrometry of Oligodeoxythymidylic Acids, " 
Rapid Communication.s in Mass Snectrnmft^fy, ^, 209-13 (1992)) and a double-stranded 
DNA of 28 base pairs (WiUiaras et al., "Time-of-F|ight Mass Spectrometiy of Nucleic 
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Acids by Laser Ablation and Ionization from a Fr zen Aqueous Matrix/' Rapid 

Commur^jcations in Mass Spectrometry, ^ 348-3S1 (1990)). In one publication (Hutb- 

Fehre et al^ 1992 , supra)^ it was shown that a nuxture of all the oligothynudylic adds 

from npl2 to n»18 nucleotides could be resolved. 

In U.S. Patent No. 5,064,754, RN A transcripts extended by DNA both of 

which are complementary to the DNA to be sequenced are prepared by incorporating 

NtPs, dNTP*s and, as terminating nucleotides, ddNTP*s which are substituted at the 5 - 

12 13 14 

position of the sugar moiety with one or a combination of the isotopes C, C, C, 
12 3 16 17 18 

H, H, H, O, Oand O. The polynucleotides obtained are degraded to 3- 
nucleottdes, deaved at the N-glycosidic linkage and the tsotopicaOy labded 5*- 
functionality removed periodate oxidadoii and the resulting formalddiyde spedes 
determined by mass spectrometry. A spedfic combination of isotopes serves to 
discriminate base-specifically between internal nudeotides originating from the 
incorporation of NTP*s and dNTP's and terminal nucleotides caused by linking ddNTP's 
to the end of the pdlynudeotide chain. A series of RNA/DNA fragments is produced, 
and in one embodiment, separated by electrophoresis, and, with the aid of the so-called 
matrix method of analysis, the sequence is deduced. 

In Japanese Patent No. 59-131909, an instrument is described which deteas 
nucleic acid fragments separated either by electrophoresis, liquid chromatography or high 
speed gd filtration. Mass spectrometric detection is achieved by incorporating into the 
micldc acids atoms which normally do not occur in DNA such as S, Br, 1 or Ag, Au, Pt, 
Os, Hg. The method, however, iis not s^iplied to sequencing of DNA using the Sanger 
method. In particular, it does not propose a base-spedfic dorrelation of such elements to 
an individual ddNTP. 

PCT Application No. WO 89/12694 (Brennan et al,, Proc SPIE-Int Soc 
Opt Eng 1206, rNew Technol Cvtom Mol Btol V pp. 60-77 (1990); and Brennan, 

U.S. Patent No. 5,003,059) employs the Sanger methodology for DNA sequencing by 

32 33 34 36 35 37 
using a combination of either the four stable isotopes S, S, S, Sbr CI, CI, 

79 81 

Br, Br to specifically label the diain-terminating ddNTP's. The sulfur isotopes can 
be located either in the base or at the alpha-position of the triphosphate moiety Whereas 
the halogen isotopes are located either at the base or at the 3 -position of the sugar ring. 
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The sequencing reaction mixtures are separated by an electrophoretic technique such as 
CZE, transferred to a combustion unit in which the sulfur isotopes of die incorporated 
ddNTP's are transformed at about 900^C in an oxygen atmosphere. The SO2 generated 
with masses of 64, 65, 66 or 68 is determined onr-line by mass spectrometry using, e.g., as 
mass analyzer, a quadrupoie with a single ion-multiplier to detect the ion current. 

A ^milar approach is proposed in U.S. Patent No. 5,002,868 (Jacob^^ 
■ gCPrOg. SPIE-lm. SOC. Opt Frllg 1435, YOm. Methods IJltra^^enririvi. Q^Prt Aw^^l 
Tech, AppI,), 26-35 (1991)) using Sanger sequendng with four ddNTP's specifically 
substituted at the alpha*poation of the triphosphate moiety with one of the four stable 
sulfur isotopes as described aboye and subsequent separation of the four sets of nested 
sisquences by tube gel electrophoresis. The only difference is the use of resonance 
ionization spearoscopy (RIS) in conjunction with a magnetic sector mass analyzer as 
disclosed in U.S. Patent No. 4,442,354 to detect the sulfur isotopes corresponding to the 
specific nucleotide terminators, and by this, allowing the assignment of the DNA 
sequence. 

EPO Patent Applications No. 0360676 Al and 0360677 Al also describe 
Sanger sequencing using stable isotope substitutions in the ddNTP's such as D ^^C 
"k 'V 'V «S, "S. "S, "s. "f. "c, "a. V ''Brand '"lortaoio- 
groups such as CF3 or 81(013)3 at the base, the sugar or the alpha position of the 
triphosphate moiety according to chemical functionality. The Sanger sequencing reaction 
nuxtures are s^arated by tube gd electrophoresis.. The efflum is converted into an 
aerosol by the dectrospray/themiospray nd>ulizer method and then atomized and ionized 
by a hot plasma (7000 to SQOO^K) and analyzed by a simple mass analyzer, An 
instnimem is proposed which enables one to automate the analysis of the Sanger 
sequencing reaction mixture consisting of tube electrophoresis, a nebulizer and a mass 
analyzer. 

The application of mass spectrometry to perform DNA sequencing by the 
hybridization/fragment method (see above) has been recently suggested (Bains, "DNA 
Sequencing by Mass Spectrometry: Outlin of a Potential Future Application," 
Chimicaoggi 9, 13-16(1991)). 
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Summary of the Invention 

The invention describes a new method to sequence DNA. ~ The 
improvements over the existing DNA sequencing technologies include high speed, high 
throughput, no required electrophoresis (and, thus, no ge! reading artifacts due to the 
5 complete absence of an electrophoretic step), and no costly reagents involving various 
substitutions with stable isotopes. The invention utilizes the Sanger sequencing strategy 
and assembles the sequence infbnnation by amdysis of the nested fiagments obtained by 
base-specific chain termination via their diff^^ molecular masses u^g mass 
spectrometry, for example, MAIDI or ES mass spectrometry. A further increase in 
10 throughput can be obtained by introdudhg mass modifications in the oligonucleotide 
primer, the chain^terminating nucleoside triphosphates and/or the chain-elongating 
nucleoside triphosphates, as well as using mtegrated tag sequences which allow 
multiplexing by hybridi2ation of tag specific probes with mass differentiated molecular 
weights. 

15 

BriefP^SCTiptionQf the FIGURES 

FIGURE 1 is ai representation of a process to generate the samples to be 
analyzed by mass spectrometry. This process entails insertion of a DNA fi'agment of 
unknown sequence into a cloning vector such as derivatives of M l 3, pUC or phagemids; 

20 transforming the double-stranded form into the single-stranded form; performing the four 
Sanger sequencing reactions; linking the base-specifically terminated nested fi^agment 
fiunily temporarily to a solid support; removing by a washing step all by-products; 
conditioning the nested DNA or RNA firagments by, for example, cation-ion exchange or 
modification reagent aiid presenting the immobilized nested firagments either directly to 

25 niass spectrometric analysis or cleaving the purified fragment family off the support and 
evaporating the cleavage reagent. 

FIGURE 2A shows the Sanger sequencing products using ddTTF as 
tenmnating deoxynucfeoside triphosphate of a hypothetical DNA fiagment of SO 
nucleotides (SEQ ID N0:3) in length with approximately equally balanced base 

30 composition. The molecular masses of the various chain terminated fragments are given. 
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FIGURE 2B shows an idealized mass spectrum of such a DNA fragment 
mixture. ... ^ . ^ 

HGURES 3 A and 3B show, in analogy to FIGURES 2A and 2B, data for 
the same model sequence (SEQ ID N0:3) with dd ATP as chain terminator. 

FIGURES 4A and 4B show data» analogous to FIGURES 2A and 2B 
when ddGTP is used as a chain terminator for the same model sequence (SEQ ID N0:3). 

FIGURES SA and SB ilhistrate the results obtained where chain 
termination is performed with ddCTP as a chain terminator, in a sunilar way as shown in 
FIGURES 2A and 2B for the same modd sequence (SEQ ID N0:3) 

FIGURE 6 summarizes the results of FIGURES 2 A to SB, shovwng the 
correlation of molecular weights of the nested four fi^gment families to the DNA 
sequence (SEQ ID N0:3). 

FIGURE 7 illustrates the general structure of mass-modified sequencing 
nucleic acid primers or tag sequencing probes for either Sanger DNA or Sanger RNA 
sequencing.. 

FIGURE 8 shows the general structure for the mass-modified 
triphosphates for either Sanger DNA or Sanger RNA sequencing. General formulas of 
the chain-elongating and the chain-terminating nucleoside triphosphates are 
demonstrated. 

FIGURE 9 outlines various linking cheniistries (X) with either 
polyethylene glycol or terminally mpnoall^ated polyethylene glycol (R) as an example. 

FIGURE 1 0 illu^uates similar linking dwmistrics as shown in FIGURE 8 
arid depicts various mass modifying moieties (R). 

FIGURE 1 1 outlines how nuihiplex mass spectrometric sequencing can 
work using the mass-modified nucleic acid primer (UP). 

FIGURE 12 shows the process of multiplex mass spectrometric 
sequencing employing mass-modified chain-elongating and/or terminating nucleoside 
triphosphates. 

FIGURE 13 shows multiplex mass spectrometric sequencing by involving 
the hybridization of mass-modified tag sequence specific probes. 
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FIGURE 14 shows a MALDMOF spectrum of a mixture of 
oligothymidylic acids, d(pT) j 2- 1 8* - 

FIGURE 15 shows a superposition of MALDI-TOF spectra of the 50-mer 
d(TAACGGTCATTACGGCCATTGACTGTAGGACCTGCATTACATGACTAGC^ 
(SEQ ID N0:3) (500 finol) and dT(pdT)99 (500 fmol). 

FIGURE 16 shows the MALDI-TOF spectra of all 13 DNA isequences 
representing the nested dT-terminated fragments of the Sanger DNA sequendng 
siimulation of Figure 2, 500 finol each. 

FIGURE 17 shows the siqierposition of the sjpectra of FIGURE 16. The 
two panels show two different scales and the spectra analyzed at that scale. 

FIGURE 18 shows the superimposed MALDI-tOF spectra fit>m MALDI* 
MS analysis of mass-modified oligonucleotides as desoibed in Example 21. 

FIGliRE 19 ilhistrates various linking chemistries between the solid 
support (P) and the nucleic acid primer (NA) through a strong electrosutic interaction. 

FIGURE 20 illustrates various linking chemistries between the solid 
support (P) and the nucleic acid primer (NA) through a charge transfer complex of a 
charge transfer acceptor (A) and a charge transfer donor (D). 

FIGURE 21 illustrates various linking chemistries between the solid 
support (P) and the nucleic add primer (NA) through a stable organic radical. 

FIGURE 22 illustrates a possible linking chemistry between the solid 
support (P) and the nucleic add primer (NA) through Watson-Crick base pairing. 

FIGURE 23 ilhistrates Imking the solid support (P) and the nucleic acid 
primer (NA) through a photolytically cieavabie bond. 

FIGURE 24 shows the portion of the sequence of pRFcl DNA, which 
was used as template for PGR amplification of immodified and 7-deazapurine containing 
99-mer and 20d-mer nudeic adds as well as the sequences of the 19-mer primers and the 
two 18-mer reverse primers. 

FIGURE 25 shows the portion of the nudeotide sequence of M13{inpl8 
RFI DNA, which was used for PGR amplification of unmodified and 7-dea2apurine 
containing 103-mer nucleic acids. Also shown are nucleotide sequences of the 17-mer 
primers used in the PGR. 
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FIGURE 26 shows the result of a polyacrylamide gei dectrophoresis of 
PCR products purified and concentrated for MALDI-TOF M S analysis: M: . chain length 
marker, lane 1 : 7-deazapurine containing 99Hner PCR product^ lane 2: unmodified 99- 
mer, lane 3: 7-deazapufine containing 103-mer and lane 4: unmodified 103-mer PCR 
product. 

FIGURE 27: an autoradiogram of polyacrylamide gd electrophoresis of 
" PCR reactions carried out with 5*-[''^]-labeled primers 1 and 4. Lanes 1 and 2: 
uranocfified and 7 -deazapurine modified 103-mer PCR product (S3321 and 23520 
counts), lanes 3 and 4: unmodified and 7-deazapurine modified 200-mer (71 123 and 
39S82 counts) and lanes S and 6: unmodified and 7-deazapurine modified 99-mer 
(173216 and 94400 counts). 

nGURE28: a) MALDI-TOF mass spectrum of the unmodified 103-mer 
PCR products (sum of twelve angle shot spectra). The mean value of the masses 
calculated for the two single strands (31768 u and 31759 u) is 31763 u. Mass resolution: 
18. b) MALDI-TOF mass spectrum of T-deazapurine containing 103.mer PCR product 
(sum of three single shot spectra). The mean value of the masses calculated for the two 
single strands (3 1727 u and 3 1719 u) is 3 1723 u. Mass resolution: 67. 

nGURE29: a) MALDI-TOF mass spectrum of the unmodified 99-mer 
PCR product (sum of twenty single shot spectra). Values of the masses calculated for the 
two ^gle strands: 30261 u and 30794 u. b) MALDI*TOF mass spectrum of the 7- 
deazapurine containing 99-mer PCR product (sum of twdve single shot spectra). Vahies 
ofthe masses calculated for the two single strands: 30224 u and 30750 u. 

FI(jURE30: a) MALDI-TOF mass ^ectnim ofthe unmodified 200-mer 
PCR product (sum of 30 single shot spectra). The mean value of the masses calculated 
for the two single strands (61873 u and 6159Su) is 61734 u. Mass resolution: 28. b) 
MALDI-TOF mass spectrum of 7-dea2apurine containing 200-mer PCR product (sum of 
30 single shot spectra). The mean value of the masses calculated for the two single 
strands(61772u and 61SI4u) is 61643 u. Mass resolution: 39. 

FIGURE 3 1 : a) MALDI-TOF mass spectrum of 7-dea2apurine containing 
lOO-mer PCR product with ribomodified primers. The mean value ofthe masses 
calculated for the two single strands (30529 u and 3 1095 u) is 30812 u. b) MALDI-TOF 
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mass spectrum of the PCR-product after hydrolytic primer-cleavage. The mean value of 
the masses calculated for the two single strands (25 1 04 u and 2S229 u) is 25-1 67 u. The 
mean value of the cleaved primers (5437 u and 59 1 8 u) is 5677 u. 

FIGURE 32 A-D shows the MALDI-TOF mass spectrum of the four 
5 sequencing ladders obtained from a 39-mer template (SEQ. ID. No. 13), which was 

immobilized to streptavidtn beads via a 3' biotinylation. A I4-mierprimer (SEQ. ID. NO. 
14) was used in the seqiiendng. 

FIGURE 33 shows a MALDI-TOF mass spectrum of a solid state 
sequencing of a 78«mer template (SEQ. ID. No. 1 S), which was immobilized to 
10 streptavidinbeads viaa3Vbiotinyiation. A lS-merprimer(S£Q IDNd: 16)acidddGTP 
were used in the sequencing. 

FIGURE 34 shows a scheme in wMch dupliex DNA probes with single- 
stranded overiiang capture specific DNA templates and also scove as primers for solid 
state sequencing. 

15 FIGURE 3 5A-D shows MALDI-TOF mass spectra-obtained from ay 

fluorescent labeled 23-mer (SEQ ID. No. 19) annealed to an 3* biotinylated I8^mcr 
(SEQ. ID. No. 20), leaving a 5-base overhang, which captured a 15-mer template (SEQ. 
ID. No. 21). 

FIGURE 36 shows a stacking flurogram of the same products obtained 
20 from the reaaton described in FIGURE 35, but run on a conventional DNA sequencer. 

Detailed Pcscriptfon of the Invention 

This imrentiofi describes an improved method of sequoidng DNA. In 
particular, tlus invention employs mass q>ectrometry to analyze the Sanger sequencing 
25 reaction mixtures. 

In Sanger sequencing, four femilies of chain-terminated fragments are 
obtained. The mass difference per nudeotide addition is 289. 19 for dpC, 313:21 fordpA, 
329.21 for dpG and 304.2 for dpT, respectively 

In one embodiment, through the separate determination of the molecular 
30 weights of the four base-specifically tenninated fragment families, the DNA sequence can 
be assigned via superposition (e.g.> interpolation) of the molecular weight peaks of the 
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four individual experiments. In another embodiment, the molecular weights of the four 
spedfically^ terminated fragment &miGes can be detennined simultaneously by MS, either 
by mixing the products of aD four reactions run in at least two separate reaction vessds 
O-C, all run separately, or two together, or three together) or by runnmg one reaction 
having all four chain-terminating nucleotides (e.g., a reaction mixture comprising dTTP, 
ddTTP, dATP, dd ATP, dCTP, ddGTP, dGTP, ddGTP) in one rseaction vessel. By 
' sifimhaneously analyzbig all four base-specifically termimited reaction products, the 
molecular weight values have been, in ^ect, interpolated. Comparison of the mass 
difference measured between fragments with the known masses of each chain-terminating 
nucleotide allows the assignment of sequence to be carried out. In some instances, it may 
be desirable to mass modify, as discussed below, the chain-terminating nucleotides so as 
to expand the difference in molecular weight between each nucleotide. It will be apparent 
to those skilled in the art when mass-modification of the chain-terminating nucleotides is 
desirable and can depend, for instance, on the resolving ability of the particular 
spectrometer employed. By way of example, it may be desirable to produce four chain- 
terminating nucleotides, ddTTP, ddCTp\ ddATP^ and ddGTP^ where ddCTp' 
2 3 

ddATP and ddGTP have each been mass-modified so as to have molecular wdghts 
resolvable from one another by the particular spectrometer being used. 

The teims chainrelongating nucleotides and chain-terminating nucleotides 
are well known in the art. For DNA, chain-elongating nucleotides include 
2 -deoxyiibonucleotides and chain-terminating nucleotides include 
2', 3'.dideoxyribonucleotides. For RNA, chain-elongating nucleotides indude 
ribonucelotides and chain-terminating nucleotides include 3*-deoxyribonucleotides. The 
term nucleotide is also well known in the art. For the purposes of this invention, 
nucleotides inchide nucleoside mono-, di-, and triphosphates. Nucleotides also include 
modified nucleotides such as phosphorothioate nucleotides. 

Since mass spectrometry is a stuizi method, in contrast to currently used 
slab gel dectrophoresis which allows several samples to be processed in parallel, in 
another embodiment of this invention, a finther improvement can be achieved by 
multiplex mass spectrometric DNA sequencing to allow simultaneous sequencing of more 
than one DNA or RNA fragment. As described in more detail below, the range of about 
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300 mass units between one nucleotide addition can be utilized by employing either mass- 
modified nucleic acid sequencing primers or chain-elongating and/or terminating 
^ nucleoside triphosphates so as to shift the molecular weight of the base-specifically 
terminated firagments of a particular DNA or RNA species being sequenced in a 
predetermined manner. For the first time, several sequendng reactions can be mass 
spectrometricalty analyzed in parallel. In yet another embodimrat of this invention, 
multiplex mass spectrometric DNA sequencing can be performed by mass madi^g the 
fi:agment families through specific oligonucleotides (tag probes) which hybridize to 
specific tag sequences within each of the fitigmoit families. In ahother embodiment, the 
tag probe can be covaleiitly attached to the individual and spedfic tag sequence prior to 
mass spectrometiy. 

Preferred mass spectrometer formaits for use in the invention are matrbc 
assisted laser desorption ionization (MALDl), electrospray (ES), ion cyclotron resonance 
(ICR) and Fourier Transform. For ES, the samples, dissolved in wat^ or in a volatile 
buffer, are injected either continuously or discontinuously into an atmospheric pressure 
ionization interface (API) and then mass analyzed by a quadrupole. The generation of 
multiple ion peaks which can be obtained using ES mass spectrometry can increase the 
accuracy of the mass determination. Even more detailed information on the specific 
structure can be obtained using an MS/MS quadrxipole configuration 

In MALDI mass spectrometiy, various mass analyzers can be used, e.g., 
magnetic sector/magnetic deflection instruments 'm single or triple quadrupole mode 
(MS/MS), Fourier transform and time-olF-flight (TOF) configurations as is known in the 
art of iiiass spectrometry. For the deisorptson/ionization process^ numerous matrix/laser 
combinations can be used. Ion-trap and reflectron config^iratioris can also be employed. 

In one embodiment of the invention, the molecular weight values of at 
least two base-specifically terminated fragmmts are determuied concurrently using mass 
spectrometry. The molecular weight values of preferably at least five and more 
preferably at least ten base-specifically terminated fragments are deternlihed by mass 
spectrometry. Also included in the invention are determinations of the molecular weight 
values of at least 20 base-specifically terminated firagments and at least 30 base- 
specifically terminated fragments. Further, the nested base-spedfically terminated 
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fiiQgments in a spedfic set can be purified of all reactants and by-products but are not 
sqKtrated from one another. The entire set of nested base-spedfically^tenninated 
fiagments is analyzed ccmcunientiy and the molecular wdght values are deternu At 
least two base-spedficaily terminated fragments are analyzed concurrently by mass 
spectrometry when the fragments are contained in the same sample. 

In genual, the overall mass spectrometric DNA sequencing process will 
- -start with a library of small genonuc fragments obtained after first randomly or 
specifically cutting the genomic DNA into large pieces which then, in several subcloning 
s^eps, are reduced in size and inserted into, vectors Gke derivatives of Ml 3 or pUC (e.g., 
M13mpl8 or Ml3mpl9) (see FIGURE!). In a difierent approach, the fragments 
inserted in vectors, such as M13, are obtained via subcloning starting with a cDNA 
Ubraiy. In yet another approach, the DNA fragments to be sequenced are generated by 
the polymerase chain reaction (e.g., Higuchi eial., "A General Method of in vitro 
Preparation and Mutagenesis of DNA Fragments: Study of Protein and DNA 
imeractions." Nucleic Acids , 16, 7351-67 (1988)). As is known in the art, Sanger 
sequencing can start from one nucleic acid primer (UP) binding to the plus^strand or from 
another nucleic acid primer binding to the opposite minus-strand Thus, either the 
complementary sequence of both strands of a given unknown DNA sequence can be 
obtained (providing for reduction of ambiguity in the sequence determination) or the 
length of the sequence information obtaimible from one clone can be extended by 
generating sequence information from both ends of the unknown vector-inserted DNA 
. fragment 

The nucleic add primer carries^ preferentially at the S'-epd, a linking 
fimctionaBty, L, which can inchide a spacer of suflBcient length and which can interact 
with a suitable fiinctionality, L', on a solid support to form a reversible linkage such as a 
photocJeavable bond. Since each of the four Sanger sequencing families starts with a 
nudeic add primer (L-UP; FIGURE 1) this fragment £miily can be bound to the solid 
support by reacung with funcUonal groups. L', on the surfece of a solid support and then 
imensively washed to remove all buffer sahs, triphosphates, enzymes, reaction by- 
products, ^c. Furthermore, for mass spectrometric analysis, it can be of importance at 
this stage to exchange the cation at the phosphate backbone of the DNA fragments in 
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order to eliminate peak broadening due to a heterogeneity in the cations bound per 
nucleotide unit. Since the L-U linkage is only of a temporary nature with Ae purpose to 
capture the nested Sanger DNA or RNA fragments to properly condition than for mass 
spectroinetric analysis, there are different chemistries ^ch can serve this purpose. In 
5 adcUtion to the examples ^ven in viihith the nested fragments are coupled covalently to 
the solid support, washed, and cleaved off the wpport for mass spectrometric analysis, 
the temporaty linkage caii be such that it is cleaved under the condhions of mass 
spectrometry, i.e., a photocleaviable bond such as a charge ixan^er complex or a stable 
organic radicai: Furthermore,^ linkagecan be formed with L'bdng a quaternary 

10 aihmoiiium group (some examples are ffvm in FIGURE 19). In this case, preferably^ the 
sur&ce of the solid support carries negative charges which repel the negatively charged 
nucl^c acid backbone aiid thus facilitates desorptton. I)esorption will take place either 
by the heat created by the laser pulse aiid/or, depending on L,* by spedfic absorption of 
laser energy which is in resonance with the V chromophore (see, e.g., examples given in 

15 FIGURE 19). The functionalities, L and L* can also form a charge transfer complex and 
thereby form the temporary L-U linkage. Various examples for appropriate 
functionalities with either acceptor or donator propierties are depicted without limitation 
in FIGURE 20. Since in many cases the "charge-transfer baiid" can be determined by 
UV/vis spectrometry (see e.g. Organic Char ge Transfer Complexes by R. Foster, 

20 Academic Press, 1 969), the laser energy can be tuned to the corresponding energy of the 
chai:ge-transfer wiiveler^h and, thus, a specific desorption off the solid support can be 
iiutiated. Those stalled in the art will recognize that several combinations can serve this 
purpose and that the donor functionality can be either on the solid support or coupled to 
the nested iSanger DNA/RNA fragments or vice verssi, 

25 III yirt another approach, the tOTporaiy linkage L-U can be genei^ 

homoiytically fonning relatively stable radicals as exemplified in FIGURE 21. In example 
4 of FIGURE 21; a combination of the approabhes u»ng charge-transfer complexes and 
stable organic radicals is shown. Here, the nested Sang^ DNA/RNA fragmems are 
captured via the formation of a charge transfer complex. Under the influence of the laser 

30 pulse, desorption (as discussed above) as well as ionization will take place at the radical 
position. In the other examples of FIGURE 21 under the influence of the laser pulse, the 
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L-L* linkage wiU be cleaved and the nested Sanger DNA/RNA fragments desorbed and 
subsequent^ ionized at the radical position fomied. Those skilied in ihe-art will 
recognize that other ciganic radicals can be sdected and that, in relation to the 
dissociation energies needed to homolytically deave the bond between them, a 
corresponding laser wavelength can be selected (see e.g. Reactive Mftl«>t»tii«.« i>y c. 
Wentnip. John Wiley St Sons, 1984). In yet another approach, the nested Sanger 
■ ' DNA/IiNA fragments are captured via Watson^Jridc base pairing to a solid support- 
bound oligonucleotide complementary to either the sequence of the nucleic acid primer or 
the tag oligonucleotide sequence (see FIGURE 22). The duplex formed wfll be cleaved 
under the influence of the laser pulse and desorption can be initiated. The solid support- 
bound base sequence can be presented through natural oligoribo- or 
oligodeoxyribonudeotide as wdl as analogs (e.g. thio-modified phosphodiester or 
phosphotriester backbone) or employing oligonucleotide miraetics such as PNA analogs 
(see e-g. Nidsen et a/., ScifincB, 254, 1497 (1991)) which render the base sequence less 
susceptible to enzymatic degradation and hence ino-eases overall stability of the solid 
support-bound capture base sequence. Wth appropriate bonds, L-L', a cleavage can be 
obtained directly with a laser tuned to the energy necessary for bond cleavage. Thus, the 
immobilized nested Sanger fi^ents can be directly ablated during mass spectiometric ' 
analysis. 

Prior to mass spectrpmetric analysis, it may be usefiil to "condition" 
nudeic add molecules, for example to decrease the bser energy requited for vobtiration, 
to nunimize fragmentation or to otherwise increase the sensitivity of mass qiectrometeric 
detection. For example, nuddc adds can be "conditioned" by adding positive or 
negative charges, i.e. charge ugs (CTs). CTs increase the mass spectrometer detection 
sensitivity by increaang the d^e of ionization during the mass spectrpmetric 
(e.g.MALDI) process. A CT can be linked either to the external 3' or 5' position or 
internally e.g. at the 2* position or at the base, e.g. at C-5 in uradl, C-5 methylgroup of 
thymine, C-5 at qnosine, at C or C* of guanine, adenine and hypoxanthine o r at the 
phosphate ester moiety. Charge tags, CTs, can function molecules with permanent (i.e. 
pH-independent) ionization, such as: 
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or molecules which generate a positive charge upon MALDI and which are stabilized by 
delocalization of the positive diarge by inesorneric effects in unsaturated and/or aromatic 
systems such as: 




wherein. 



R, K\ = H,OAI (wherein Al= e.g. lower alkyl, methyl, 
ethyl, propyl), NO2, CR COjH, COj active ester, br 
halogen; and 

X = -0-, -NH-, -S-, C=0, OCO cither in the para br meta 
position. 



For example, the positive charge of a trityl cation is produced during MALDI by the 
15 removal of a moirty such as: -OR, where R = a lovirer alkyl, or an anion such as CIO/, 
SbF/, BF/ and the like. 



20 
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In an alternative scheme, the trityl group is used to anchor the 
oligonucleotide to a solid support via the tertiary carbon and this bond is cleaved during 
mass spectrometry (e.g. MALDI), leaving a positive charge on the desorbing and high 
vacuum flying oligonucleotide. 



30 



One of skill in the art can readily appreciate several Agnations to the schenies described 
above. In ad(Btion to emplojdng the charge tag array alone, one of skill in the art can 
employ a charge tag array in conjunction wth another conditioning means. Particularly 
preferred means to be used in conjunction with the CT include treating the 
phosphodiester bond with trialkylsilyl halides or the phosphomonothiodiester bond with 
alkyiiodides to render the polyanionic backbone neutral. 
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Another example of conditioning is modification of the phosphodiester 
backbone of the nucleic add molecule (e.g. cation exchange), which can be useful for 
eliminating peak broadening due to a heterogeneity in the cations bound per nucleotide 
unit In addition, a nucleic add molecule can be contacted with an all^bting agent such 
as all^liodide, todoacetamide, p^iodpethanol, or 2,3-epoxy-l-propanol, the monothio 
phosphodiester bonds of a middc add molecule can be transformed into a 
' pfioi^hotriester bond. Likewise, phosphodiester bonds may be transfonned to imcharged 
derivatives employing trialkylsilyl chlorides. Further conditioning involves incorporating 
nudeotides which reduce sensitivity for depurination (fragmentation during MS) such as 
N7- or N9-deazapurine nudeotides, or RNA building blocks or u^g oligonucleotide 
triesters or incorporating phosphorothioate &nctions which are alkylated or employing 
oligonucleotide mimetics such as PNA. 

Modification of the phosphodiester backbone can be accomplished by, for 
example, using alpha-thio modified nucleotides for chain elongation and termination. 
With alkylating agents such as akyliodides, iodoacetamide, P-iodoethanol, 2,3-epoxy-l- 
propanol (see FIGURE 10), the monothio phosphodiester bonds of the nested Sanger 
fragments are transformed into phosphotriester bonds. Multiplexing by mass 
modification in this case is obtained by mass-modifying the nucleic acid primer (UP) or 
the nucleoside triphosphates at the sugar or the base moiety. To those skilled in the art, 
other modifications of the nested Sanger fragments can be envisioned. In one 
embodiment of the invention, the linking chemistry allows one to deave off the so- 
purified nest^ DNA enzymatically, chemically or physically. By way of example, the L- 
L' chenustry can be of a type of disulfide bond (chemically cleavable, for example, by 
mercaptoethanol or dithioerythrol), a biotin/strepta^adin system, a heterobifimctional 
derivative of a triiyl ether group (Koster etaL, "A Versatile Acid-Labile Linker for 
Modification of Synthetic Biomolecules," Tetrahedron Letters n, 7095 (1990)) w*ich 
can be cleaved under mildly acidic conditions, a ievulinyl group cleavable under almost 
neutral conditions with a hydrazinium/acetate buffer, an arginine-arginine or lysine-lysine 
bond cleavable by an endopeptidase enzyme like trypsin or a pyrophosphate bond 
cleavable by a pyrophosphatase, a photocleavable bond which can be, for example, 
physically cleaved and the like (see. e.g., FIGURE 23), Optionally, another cation 
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exchange can be performed prior to mass spectrometric analysis. In the instance that an 
enzyme-cleavable bond is utilized to immobilize the nested fragments, the enzyme used to 
cleave the bond can serve as an internal mass standard during MS analysis. 

The purification process and/or ion exchange process can be carried out by 
a number of other methods instead o^ or in conjunction with, immobilization on a sofid 
support. For example, the base-specifically terminated (H-oducts can be separated fix>m 

reactants by dialysis, filtrarion Oncluding ultrafihrationX and chibmatogra|>hy. 
' Likewise, these techniques can be used to exchange the cation of the phosphate badd)one 
with a counter*son which reduces peak broadening. 

The base-spedficaUy ternunated fragment femilies can be general 
standsu*d Sanger sequencing using the Large Klenow fragment of £1 cdli DNA 
polymerase I, by Sequenase, Taq DNA polymerase and Other DNA polymerases suitable 
foi this purpose, thus generating nested DNA frajgments for the mass spectrometric 
analysis: It is, however, part of this invention that base-specifically terminated RNA 
transdipts of the DNA fragments to be sequenced can also be utilized for mass 
spectrometric sequence determination. In this case, various RNA polymerases such as 
the SP6 or the T7 RNA polymerase can be used on appropriate vectors containing, for 
example, the SP6 or the T7 promoters (e g. Axelrod ei al, "Transcription fi-om 
Bacteriophage T7 and SP6 RNA Polymerase Promoters in the Presence of 3- 
Deoxyribonucleoside S*-tripho5phate Chain Terminators/ Biochemistry 24, 5716-23 
( 1 985)). In this case, the unknown DNA sequence fragments are inserted downstream 
from such promoters. Transcription can also be initiated by a nucleic acid primer (PituUe 
ei aL^ "Initiator Oligonucleotides for the Combination of Chemical and Enzymatic RNA 
Synthesis;^ fifiOfi 112, 101-lOS (1992)) which carries^ as one embodiment of this 
tnventioh; appropriate finking fimctionalities, L, which aDow the immobilization of the 
nested RNA fragments, as outlined above, prior to mass spectrometric analysis for 
purification and/or appropriate modification and/or conditioning. 

For ttus immobilization process of the DNA/RNA sequencing products for 
nuiss spectrometric ^naly^s, various soUd supports can be used, e:g., beads (^ca gel, 
controlled pore glass, magnetic beads, Sephadex/Sepharose beads, cellulose beads, etc.), 
csq>iUaries, glass fiber filters, glass surfaces, metal surfaces or plastic material. Examples 
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of useful plastic materials include membranes in filter or microtiter plate formats, the 
latter allowing the automation of the purification process by employing microtiter plates 
which, as one embodiment of the invention, carry a permeable membrane in the .bottom of 
the well fiinctionalized with L'. Membranes can be based on polyethylene, polypropylene, 
polyamide, polyvinylidenedifluoride and the like. Examples of suitable metal surfaces 
include sted, gold, silver, aluminum, and copper. After purification, cation exchange, 
' anB^br modification of the phosphodsester backbone of the L-U bound nested Sanger 
firagments, they can be cleaved off the solid support ch^cally, enzymatically or 
phystcally.> Also, the lAJ bound fi^agments can be cleaved finom the support when they 
are subjected to mass spectrometric analysis by using appropriately chosen L-L* linkages 
and corresponding laser energies/intensities as described above and in FIGURES 19-23. 

The highly purified, four base-specifically terminated DNA or RNA 
fragment families are then analyzed with regard to their fi^agment lengths via 
determination of their respective molecular weights by MALDI or ES mass spectrometry. 

For ES, the samples, dissolved in water or in a volatile buffer, are injected 
either continuously or discontinuously into an atmospheric pressure ionization interface 
(API) and then mass analyzed by a quadrupole. With the aid of a computer program, the 
molecular weight peaks are searched for the known molecular weight of the nucleic acid 
primer (UP) and determined ^ch of the four chain-tarminating nucleotides has been 
added to the UP. This represents the first nucleotide of the unknown sequence. Then, 
the second, the third, the n extension product can be identified in a similar manner and, 
by this, the nucleotide sequence is assigned. The generation of multiple ion peaks which 
can be obtained using ES mass spectrometry can ino-ease the accuracy of the mass 
determination. 

In MALDI mass spectrometry, various mass analyzers can be used, e.g., 
magnetic sector/magnetic deflection instruments in single or triple quadrupole mode 
(MS/MS), Fourier transform and time-of-flight (TOF) configurations as is known in the 
art of mass spectrometry. FIGURES 2 A through 6 are given as an example of the data 
obtainable when sequoicing a hypothetical DNA fi-agment of 50 nucleotides in length 
(SEQ ID N0:3) and having a molecular weight of 15,344.02 daltons. The molecular 
weights calculated for the ddT (FIGURES 2A and 2B), ddA (FIGURES 3 A and 3B), 
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ddG (FIGURES 4A and 4B) and ddC (FIGURES 5A and SB) tenninated products are 
given (corresponding to fi-agments of SEQ ID N0:3) and the idealized four^VlALDI-TOF 
mass spectra shown. All four spectra are superimposed, and from this, the DNA 
sequence can be generated. Thb is shown in the summarizing FIGURE 6, demonstrating 
S how the molecular wdghts are correlated with the DNA sequoice. MALDI-TOF spectra 
have been generated for the ddT terminated products (FIGURE 16) corresponding to 
Aose shown' in FIGURE 2 and these spectra have been superimposed (FIGURE 1 7). 
The correlation of calculated molecular weights of the ddT fragments and thdr 
experimentally-verified wdghts are shown in Table 1. Ukewise, if all fcMir chain- 
10 terminating leaictions are combined and dlen ainalyzed by mass spec^ 

molecular weight difference between two adjacem peaks can be used to determine the 
sequence. For the desorption/ionization process, numerous matrix/laser combinations 
canbeused. 



Correlation of calculated and »cperimentally verified molecular weights of the 13 DNA 

fi-agments of FIGURES 2 and 16. 



15 



TABLE I 



20 



Fragment 
(n-mer) 



calculated mass 



ecperimental mass 



difference 



30 



25 



7-mcr 

10- mer 

11- roer 

19- mer 

20- mer 
24-mer 
26-mer 
33-mer 

37- mer 

38- mer 
42-mer 
46-mer 
50-mer 



2104.45 

3011.04 

3315.24 

5771.82 

6076.02 

7311.82 

7945.22 

10112.63 

11348.43 

11652.62 

12872.42 

14108.22 

15344.02 



2119.9 

3026.1 

3330.1 

5788.0 

6093.8 

7374.9 

79(50.9 

10125.3 

11361.4 

11670.2 

12888.3 

14125.0 

15362.6 



+15.4 
+15.1 
+14.9 
+16.2 
+17.8 
+63.1 
+15.7 
+12.7 
+13.0 
+17.6 
+15.9 
+16.8 
+18.6 
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In order to mcrease throu^ut to a level necessary for high yotume 
genomic and cDNA sequencing projects, a further embodiment of the present invention is 
to.utilize multiplex mass spectrometry to simultaneously determine more than one 
sequence. This can be achieved by several, albeit diflFerent, methodologies, the basic 
principle being the mass modification of the nucleic acid primer (UP), the chain- 
elongating and/or terminating nucleoside triphosphates, or by using mass-differentiated 
' i^ prob^ hybii^tzable to specific tag sequences. The term "nucleic acid primer" as used 
herein encompasses primers for both DNA and RNA Sanger sequencing. 

By way of example, FIGURE 7 presents a general formula of the nucleic 
acid primer (UP) and the tag probes (TP). The mass modifying moiety can be attached, 

for instance, to either the 5'-end of the oligonucleotide (M^, to the nucleobase (or bases) 

2 7 3 
(M , M X to the phosphate backbone (M ), and to the 2 -position of the nucleoside 

4 6 5 
(nucleosides) (M , M ) or/and to the terminal 3 ^-position (M ). Primer length can vary 

between 1 and 50 nucleotides in length. For the priming of DNA Sanger sequencing, the 
primer is preferentially in the range of about 15 to 30 nucleotides in length. For 
artificially printing the transcription in a RNA polymerase-mediated Sanger sequenciiig 
reaction, the length of the prirner is preferentially in the range of about 2 to 6 nucleotides. 
If a tag probe (TP) is to hybridize to the integrated tag sequence of a femily chain- 
terminated fragments^ its preferential length is about 20 micleotides. 

The table in FIGURE 7 depicts some examples of mass-modified primer/tag 
probe configurations for DNA, as well as RNA, Sanger sequencing. This list is, however, 
not meant to be limiting, since numerous other combinations of mass*modi^g functions 
and positions mihin the oligonucleotide molecule are possible and are deemed part of the 
invention; The mass-modifying functionality can be, for example, a halogen, an azido, or 
of the type, XR, wherrin X is a linking group and R is a mass-rnodifying functionality. 
The mass*modifying functionality can thus be used to introduce defined mass increments 
irito the oligonucleotide molecule. 

In another embodiment, the nucleotides used for chain-elongation and/or 
termination are mass-modified. Examples of such modified nucleotides are shown in 
FIGURE 8. Here the mass-modifying moiety, M, can be attached either to the 
nucleobase, (in case of the c -deazanucleosides also to C-7, m\ to the triphosphate 



IVO $7/37041 



PCT/US97/04394 



-28. 

group at the alpha phosphate, , or to the 2 -portion of the sugar ring of the nucleoside 
4 6 

triphosphate, M andM . Furthennore, the nms*modtfying fonctioriaiily cSui be added 

so as to affect diain termination, such as by attaching it to the 3 ^position of the sugar 

- -5 

ring in the nucleoside triphosphate, M . The list in FIGURE 8 represents examples of 
S pos»ble configurations for generating chath*terminating nucleoside triphosphates for 
RNA or DNA Sanger seiquendng. For those skiOed in the art, however, it is clear that 
many other combinations can serve the purpose of the invehtioii equally well. In the 
same way, those skilled in the art will recognize that chain-^ohgating nucleoside 
triphosphates can also be mass-modified in a simflar fii^oii with numerous variations and 

10 combinations in fonctiohklity and attadmiem p 

Without limiting the scope of the invention, FIGURE 9 gives a more 
detailed description of particular examples of how the mass*modification, can be 
introduced for X in XR as well as using oligo-/polyethylene glycol derivatives for R The 
mass-modifying increment in this case is 44, i.e. five different mass-modified species can 

15 be generated by just changing m fi-om 0 to 4 thus adding mass units of 45 (m=0); 89 
(nT=l), 133 (m=2X 177 (m=3) and 221 (m=4) to the nucleic acid primer (UP), the tag 
probe (TP) or the nucleoside triphosphates respectively. The oligo/polyethylerie glycols 
can also be mohoalkylated by a lower alkyl such as methyl, ethyl, propyl, isopropyl, t- 
butyl and the like. A selection of linking fimctionalities, X, are also illustrated. Other 

20 cheitustries can be used in the mass-^modified compounds, as for sample, those described 
recently in Qligotiucledtides and Analogues. A Practical Apnroach. F^ Eckstein, editor, 
IRL Press, Oxford, 1991. 

In y^ another embodiment, various niass-modifying flinctionatities, R, other 
than oligo/polyethylene glycols, can be selected and attached via appropriate linking 

25 chemistries, X. Without any limitation, some examples are given in FIGURE 10. A 
simple mass-modification caii be achieved by substituting H for halogens like F, CI, Br 
and/or 1, or pseudohalogens such as SCN, NCS, or by using different alkyl, aryl or aralkyl 
moieties such as methyl, ethyl, propyl, isopropyl, t-butyl, hexyl, phenyl, substituted 
phenyl, benzyl, or functional groups such as CH2F. CHF2, CF3, Si(CH3)3, 

30 Si(CH3)2(C2H5), Si(CH3)(C2H5)2, Si(C2H5)3 . Yet another mass-modification can be 
obtained by attaching homo- or heteropeptides.through X to the UP, TP or nucleoside 
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triphosphates. One example useful in gemrattng mass-modified spedes with a mass 
increment of 57 is the attachment of oligoglycines, e.g., mass-modtflcaticms of 74 (r=l, 
m=0), 131 (r-U m=2), 188 (r=l, m=3). 245 (r=l. m=4) are achieved. Simple 
oligoamides also can be used, e.g., mass-modifications of 74 (f=.l, m=0), 88 (r=2. m=OX 
102 (r=3, m=0), 1 16 (r=4, wfO). etc. are obtainable. For those skilled in the art. it wiU 
be obvious that there are numerous possibilities in addition to those given in FIGURE 10 
- and the above mentioned reference fOligonndentides and Amlngu^c, v Eckstein, 1991), 
for inurodudng, in a predetermined manner, many different mass-modifying fijnctionalittes 
to UP, TP and nucleoside triphosphates which are acceptable for DNA and RNA Sanger 
sequencing. 

As used herein, the superscript 0-i designates i + 1 mass differentiated 

nucleotides, primers or tags. In some instances, the superscript 0 (e.g., NTP° UP^) can 

designate an unmodified species of a particular reactant, and the superscript i (e.g., NTp\ 
1 2 

NTP , NTP , etc.) can designate the i-th mass-modified species of that reactant. If, for 
example, more than one species of nucleic acids (e.g., DNA clones) are to be 
concurrently sequenced by multiplex DNA sequencing, then i + 1 different mass-modified 
nucleic acid primers (UP^, UP^'...UP*) can be used to distinguish each set of base- 
specifically terminated fi-agments, wherein each species of mass*modified Up' can be 
distinguished by mass spectrometry from the rest. 

As illustrative embodiments of this invention, three different basic processes 
for multiplex mass spearometric DNA sequencing employing the described mass- 
modified reagents are described below: 

A) Multiplexing by the use of mass-modified nucleic acid primers 
(UP) for Sanger DNA or RNA sequencmg (see for example FIGURE 1 1); 

B) Multiplexing by the use of mass-modified nucleoside 
triphosphates as chain elongators and/or chain terminators for Sanger 
DNA or RNA sequencing (see for example FIGURE 12); and 

C) Multiplaing by the use of tag probes which specifically 
hybridize to tag sequences which are integrated into part of the four 
Sanger DNA/RNA base-specifically terminated fi-agment families. Mass 
modification here can be achieved as described for FIGURES 7, 9 and 10, 
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or alternately, by designing different oligonucleotide sequences having the 

same or different length with unmodified nucleotides which, in a - 

predetermined way, generate appropriately differentiated molecular 

weights (see for example FIGURE 13): 

S The process of multiplexing by mass-modified micleic acid primers (UP) is 

illustrated by way of example in FIGURE 11 for mass analyzing four different DNA 

clones amuitaneously. The firist reaction nuxture is obtained by ^andard Sanger DNA 

sequehdng having unknown DNA fi^agment 1 (clone 1) integrated in an appropriate 

vector (e.g:, M13mpl 8), employing an unmodified niicleic acid primer UP^, and a 

10 standard mbcture of the four unmodified deoxynucleoside triphosphates, dNTP^, and 

with l/lMh ofoneofthefourdideoxynudeddde triphdsphaitc^ A second 

reaction mixture for DNA fi^gment 2 (clone 2) is obtained by employing a mass-modified 

nucldc add primer UP^ and, as before, the four unmodified nudeoade triphoisphates, 
0 

dNTP , containing in each separate Sanger reaction l/10th of the chain-terminating 

1 S unmodified dideoxynucleoinde triphosphates ddNTP^. In the other two experiments, the 

2 

four Sanger reactions have the following compositions: DNA firagment 3 (clone 3), UP , 

dNTP^, ddNTP^ and DNA fiagment 4 (done 4), Up'. dNTP^, ddNTP^. For mass 

spectrometric DNA sequencing, all base-specifically terminated reactions of the four 

clones are pooled and mass analyzed. The various mass peaks belonging to the four 

20 dideoxy-terminated (e.g., ddT-terminated) fi^gment families are assigned to specifically 

elongated and ddT-terminated fi-agments by searching (such as by a computer program) 

for the known molecular ion peaks of UP^, Up' , UP^ and UP"^ extended by either one of 

the four dideoxyrtudcoside triphosphates, UP^-ddN^, UP^-ddiN^, UP^-ddN^ and UP"^- 
0 

ddN . In this way, the first nucleotides of the four unknown DNA sequences of done ) 
25 to 4 are determined. The process is repeated, having memorized the molecular masses of 
the four spedfic first extension products, until the four sequences ve assigned. 
Unambiguous mass/sequence assignments are possible even in the worst dase scraario in 
which the four mass-modified nucleic acid primers are extended by the same 

0 

dideoxynudeoside triphosphate, the extension products then being, for example, UP - 

12 3 
30 ddT, UP -ddT, UP -ddT and UP -ddT, which differ by the known mass increment 

differentiating the four nucldc acid primers. In another embodiment of this invention, an 
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analogous technique is employed using different vectors containing, for csxample, the SP6 

and/or T7 promoter sequences, and performing transcription with the nucleic acid 

0 12 3 
primers UP , UP , UP and UP and efther an RNA polymerase (e.g., SP6 or T7 RNA 

polyroerase) with chain-elongating and terminating unmodified nucleoside triphosphates 
0 0 

NTP and 3'-dNTP Here, the DNA sequence is bemg determined by Sanger RNA 
sequencing. 

FIGURE 12 illustrates the process of multiplexing by mass-modified chain- 
elongating or/and terminating nucleoside triphoq>hates in whidi three diffmnt DNA 
fragments (3 clones) are mass analyzed simultaneously. The first DNA Sanger 
sequencing reaction (DNA fiagment 1, clone 1) is the standard mixture employing 
unmodified nucleic add primer UP^. dNTP^ and in each of the four reactions one of the 
four ddNTP . The second (DNA fragment 2, clone 2) and the third (DNA fiagment 3, 
clone 3) have the following contents: UP^, dNTP^, ddNTP ^ and UP^, dNTP^, ddNTP^ 
, respectively. In a variation of this process, an amplification of the mass increment in 

mass-modifying the extended DNA fi-agments can be achieved by either using an equally 

12 

mass-modified deoxynucleoside triphosphate (i.e., dl^ITP , dNTP ) for chain elongation 

alone or in conjunction with the homologous equally mass-modified dideoxynucleoside 

triphosphate. For the three clones depicted above, the contents of the reaction mixtures 

can be as follows: either UP^/dNTP^/ddNFP^, UP^/dNTP ^/ddNTP^ and 

UP°/dNTP^/ddNTP^ or JUP^/dNTP^/ddNTP^, UP^/dNTp'/ddNTP^ and 
0 2 2 

UP /dNIP /ddNTP . As described above, DNA sequencing can be performed by 
Sanger RNA sequencing employing unmodified nucleic acid primers, UP*^, and an 
appropriate mhcture of chain-elongating and terminating nucleoside triphosphates. The 
niass-modification can be again either m the chain-terminating nucleoside triphosphate 
alone or in conjunction with mass-modified chain-elongating nucleoside triphosphates. 
Multiplexing is achieved by pooling the three base-specifically tenninated sequencing 
reactions (e.g., the ddTTP terminated products) and simultaneously analyzing the pooled 
products by mass spectrometry. Again, the first extension products of the known nucleic 
add primer sequence are assigned, e.g., via a computer program. Mass/sequence 
assignments are possible even in the worst case in which the nucleic acid primer is 
extended/terminated by the same nudeotide, e.g., ddT, in all three dones. The following 
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configurations thus obtained can be well diifferentiated by thdr different mass* 
moaifications: UP°-ddT^,UP^^dT\uP^-ddT^. - 

In yet another embodiment of this invention, DNA sequencing by multiplex 
mass spectrometry can be achieved by cloning the DNA fragments to be sekjuenced in 
S *'pIex-veaors" containir^ vector specific "tag sequences" as described (Koster et al , 
"Oligonucleotide Synthesis and Multiplex DNA Sequendng Using Chmiluminescent 
iPetection/ Nuclric Acids RB- Symposhuh Ser. No. 24, 318021 (1991)); then pooling 
dones from dififerent plex-vectors for DN A preparatioh and the four separate Sanger 
sequencing reactions using standard dNTP^/ddNTP^ and nucleic acid primer UP^; 

10 purifying the four multiplex fragment families via linking to a solid support through the 
linking group, at the 5*-^d of UP; washing om aU by-product^ 
purified multiplex DNA fragments off the support or using the L-L* bound nested Sanger 
fiagmc^s as such for mass spectrometric analysis as described above; performing de- 
multiplexing by one^by-ohe hybridization of specific "tag probes"; and subsequently 

15 analyzing by mass spectrometry (see, for example, FIGURE 1 3 ). As a reference point, 
the four base-spedficaliy terminated multiplex DNA fragment families are run by the 
mass spectrometer and all ^dT^-, dd A^-, ddC^- and ddG^-tcrminated molecular ion 
peaks are respectively detected and memorized Assignment of, for example^ ddT - 
terminated DNA fragments to a specific fragment family is accomplished by another mass 

20 spectrometric analysis after hybridization of the specific tag probe (TP) to the 

corresponding tag sequence contained in the sequence of this specific fragment family. 
Oiily those molecular ion peaks which are ci^>able of hybridizing to the specific tag probe 
are shifted to a higher molecular mass by the same known mass increment (e.g. of the tag 
probe). These shifted ibn peaks, by virtue of all hybridizing to a ^edfiic tag probe, 

25 belong to the same firagment femily. For a gK^en fragment fiunily, this is repeated for the 
remaining chain terminated fragment families with the same tag probe to assign the 
complete DNA sequence. This process is repeated i-1 times corresponding to i clones 
multiplexed (the 

i-th clone is identified by default). 
30 The differentiation of the tag probes for the different muhiplexed clones can 

be obtained just by the DNA sequence and its ability to Watsoh-Crick base pair to the tag 
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sequence. It is well known in the art how to calculate stringency conditions to provide 
for specific hybridization of a given tag probe with a given tag sequence (see» for 
example. Molecular Cloniny: A lahfiratorv nmniml 2ed, ed. by Sambrook, Fritsch and 
Maniatjs (Cold Spring Harbor Laboratoiy Press: NY, 1989, Chapter 1 1). Furthermore, 
dififerentiation can be obtained by designing the tag sequence for each plex-vector to have 
a sufficient mass difference so as to be unique just by changing, the length or base 
- xdinposition or by mass-modifications according to FIGURES 7, 9 and 10. In order to 
keep the duplex between the tag sequence and the tag probe intact during mass 
spectrometric analysis, it is another embodiment of the invention to provide for a covalent 
attachment mediated by, for example, photoreacdve groups such as psoralen and 
ellipticine and by other methods known to those skilled in the an (see, for example, 
Helene e/a/.,HalUEC 358 (1990) and Thuong et ai "Oligonucleotides Attached to 
Intcrcalators, Miotoreactive and Cleavage Agents" in F Eckstein, Oliyonudenririft^t 
Analogues A Practical Apprnarh IRL Press, Oxford 1991, 283-306). 

The DNA sequence is unraveled again by searching for the lowest 
molecular weight molecular ion peak corresponding to the known UP^-lag sequence/tag 
probe molecular weight plus the first extension product, e.g.. ddT° then the second, the 
third, etc. 

In a combination of the latter approach with the previously described 

multiplexing processes, a further increase in multiplexing can be achieved by using, in 

addition to the tag probe/tag sequence intmction, mass-modified nucldc acid primers 

(FIGURE 7) and/or mass-modified deoxynucleoade, dNTP°*'' and/or dideoxynucleoside 

_ ^ 0-i 
tnphosphates, ddNTP . Those skilled in the art will realize that the tag sequence/tag 

probe multiplexing approadi is not limited to Sanger DNA sequencmg generating nested 

DNA fragments with DNA polymerases. The DNA sequence can also be determined by 

transcribing the unknown DNA sequence fi-om appropriate promoter-containing vectors 

(see above) with various RNA polymerases and mixtures of NTP°"'/3 -dNTP^*', thus 

generating nested RNA fhgments. 

In yet another embodiment of this invention, the mass-modifying 

functionality can be introduced by a two or multiple step process. In this case, the 

nucleic acid primer, the chain-elongating or terminating nucleoside triphosphates and/or 
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the tag probes are, in a first step, modified by a precursor functionality such as azido, - 
N3, or modified with a functional group in which the R in XR is H (FIGURE 7, 9) thus 
providing temporary fimctions, e.g:, but not Ihnited to -OH, -NH2, -NHR, -SH, -NCS, 
.DCO(CH2)rCOOH (r = 1^20). -NHCO(CH2)rCOOH (r = 1-20), -OSOjOH, 
S -OCO(CH2)|1 (r = 1-20), ^P(0-Alkyl)N(Ailcyl)2. These less bulky functionalities result 
in better substrate properties for the enzymatic DNA or RNA synthesis reactions of the 
DNA sequencing process. The appropriate mass^modtfying functionality is then 
introduced after the generation of the nested base-specifically terminated DNA or RNA 
fragments prior to hiass speorometry. Several example^ of compounds which can serve 

10 as mass-modifyiiig fiinctionafities are depicted in FIGURES 9 and 10 without limiting the 
scope of tins in vemibn. 

Another aspect of this invention concerns kits for sequencing nucldc acids 
by mass spectrometiry which include combinations of the above-described sequencing 
reactants. For instance, in one embodiment, the kit comprises reactants for multiplex 

15 mass spectrometric sequencing of several different species of nucleic acid. The kit can 
include a solid support having a linking functionality (lS for immobilization of the base- 
specifically terminated products; at least one nucleic acid primer having a linking group 
(L) for reversibly and temporarily linking the primer and solid support through, for 
example, a photocleavable bond; a set of chain-elongating nucleotides (e.g., dATP, 

20 dCtP, dGTP and dTTP, or ATP, CTP, GTP and UTP); a set of chain-terminating 

nucleotides (such as 2\3 •didebxynucleotides for DNA synthesis or 3 -deoxyiiucleotides 
for RNA synthesis); and an appropriate polymerase for synthesizing complementary 
liiicleotides. Primers and/or terminating nucleotides can be mass-modified so that the 
base-specifically tenninated fitigments genmted from one of the ^ecies of nucldc adds 

25 to be sequenccid can be distinguished by mass spectrometry from all of the others. 
Alternative to the use of mass-modified synthesis reactants, a set of tag probes (as 
described above) can be included in the kit. The kit caii also indude appropriate buffers 
as well as instructions for performing multiplex mass spectrometry to concurrently 
sequence multiple spedes of nucleic adds. 

30 In another embodiment, a nucldc acid sequendng kit can comprise a solid 

support as described above, a primer for initiating synthesis of complementary nucldc 
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add fragments, a set of chain-elongating nucleotides and an appropriate polymerase. The 
mass-modified, chain-terminating nucleotides are selected so that the addition of one of 
the chain tmninators to a growing complementary nucleic add can be distinguished by 
mass spectrometry. 

The present invention is further illustrated by the following examples which 
should not be construed as limiting in any way. The contents of all cited references 
' '(including literature references, issued patents, published patent applications (induding 
international patent application Publication Number WO 94/16101, entitled "DNA 
Sequencing by Mass Spectrometry" by Kqesten.and international patent application 
Publication Number WO 94/21822 entitled "DNA Sequencing by Mass Spectrometry Via 
Exonuclease Degradation** by H. Koester), and co-pending patent applications, (including 
U.S Patent Application Serial No. 08/406,199, entitled "DNA Diagnostics Based on 
Mass Spectrometry" by H. Koester), as cited throughout this application are hereby 
expressly incorporated by reference. 

EXAMPLE 1 

Immobilization of primer-extension products of Sanger DNA sequencing reaction 
for mass spectrpmetric analysb via disalfide bonds. 

As a solid support, Sequelon membranes (Millipore Corp., Bedford. MA) 
with phenyl isothiocyanate groups are used as a starting mat^al. The mmbrane disks, 
with a dianietqr of 8 mm, are wetted with a sohition of N-methylmorpholine/water/2- 
propanol (NMM solution) (2/49/49 v/v/v), the excess liquid removed with filter paper 
and placed on a piece of plastic film or ahmunum foil located on a heating block set to 
55 C. A sQhition of 1 mM 2^meraq>toethyhmime (cysteamine) or 2, 2 -dithio^ 
bis(ethylamine) (cystamine) or S-(2-thiopyridyI)-2-tiiio.eUiylamine (10 ul, 10 nmol) in 
NMM is added per disk and heated at 55^C. After 15 nun, 10 ul of NMM solution are 
added per disk and heated for another 5 min. Excess of isothiocyanate groups may be 
removed by treatment with 10 ul of a 10 mM solution of glycine in NMM solution. For 
cystamine, the disks are treated with 10 ul of a solution of IM aqueous dithiothreitol 
(DTT)/2-propanol (1:1 v/v) for 1 5 min at room temperature. Then, the disks are 
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thoroughly washed in k filtration manifold with 5 aliquots of 1 ml each of the NMM 
solution, then with 5 aliquots of 1 ml acetonitrile/water (l/I v/v) and subsequently dried. 
If not used immediately the disks are stored whh free thiol groups in a solution of IM 
aqueous dithiothreitol/2-propanol (1:1 v/v) and, before use, DTT is removed by three 
washii^ with 1 ml eadi of the NMM solution. The primer oligonucleotides with 5*SH 
functionality can be prepaired by various m^ods (e.g., B.C.F Cfau et dl^ Ijuddc 
AiadtRffiLH, 5591-5603 (1986), Sprnat ^/^.. Nudric Acids Res IS. 4837-48 (1987) 
and Qligoinicleotides and Analogues: A Practical Approach ( F. Eckstein, editor), KL 
Press Oxford, 1991). Sequencing reactions according to the Sanger protocol are 
performed in a standard way (e.g., H. Swerdldw et ai, Mucleic Adds Res. 18. 1415-19 
(1990)). In the presence of about 7«10 mM DTT the free 5 -thiol primer can be used; in 
other cases, the SH functionality can be protected, e.g., by a trityl group duriiig the 
Sanger sequencing reactions and removed prior to anchoring to the support in the 
following way. The four sequencing reactions (1 50 ul each in an Eppendorf tube) are 
tenninated by a 10 min incubation at 70^C to denature the DNA polymerase (such as 
Klenow fragment, Sequenase) and the reaction mixtures are ethanol precipitated. The 
supematants are removed and the pellets vortexed with 25 ul of an IM aqueous silver 
nitrate solution, and after one hour at room temperature, 50 ul of an 1 M aqueous 
solution of DTT is added and mixed by vortexing. After 1 5 min, the mixtures are 
centrifiiged and the pellets aris washed twice with 100 ul ethylacetate by vortexing and 
centrifugation to remove excess DTT. The primer extension products with free 5 -thiol 
group are now coupled to the thiolated membrane supports under mild oxidizing 
conditions. In general, it is suffident to add the 5 -thiolated primer extension products 
dissolved in 10 ul 10 mM de-aerated triethylammonium acetate buffer (TEAA) pH 7.2 to 
the thiolated membrane supports. Coupling is achieved by drying the samples onto the 
membrane disks with a cold fan. This process can be repeated by wetting the membrane 
with 10 ul of 10 mM TEAA buffer pH 7.2 and drying as before. When using the 2- 
thiopyridyl derivatiz»i compounds, anchoring can be monitored by the release of 
pyridine-2-thione spectrophotometrically at 343 nm. 

In another variation of this approach, the oligonucleotide primer is 
fanctionalized wth an amino group at the 5'-end which is introduced by standard 
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procedures during automated DNA synthesis. After primer extension, during the Sanger 
sequencing process, the primary amino group is reacted with 3-(2-pyridyldithio) 
propionic acid M-bydro}^succinimide ester (SPDP) and subsequently coupled to the 
thiolated supports and monitored by the release of pyridyI-2-thione as described above. 
After denaturation of DNA polymerase and etbanol precipitation of the sequencing 
products, the supernatants are removed and the pell^ dissolved in 10 ul 10 mM TEAA 
buffer pH 7.2 and 10 ul of a 2 mM solution of SPDP in 10 mM TEAA are added. The 
reaction mixture is vortexed and incubated for 30 min at 25?C Excess SPDP is then 
removed by three extractions (vort^g, centrifiigation) with 50 ul each of ethanol and 
the resulting pellets are dissolved in 10 uI lO mM TEAA buffer pH 7.2 and coupled to 
the tluolated supports (see above). 

The primer-extension products are purified by washing the membrane disks 
three times each with 100 ul NMM solution and three times with 100 ul each of 10 mM 
TEAA buffer pH 7.2. The purified primer-extension products are released by three 
successive treatments with 10 ul of 10 mM 2-mercaptoethanol in 10 mM TEAA buffer 
pH 7.2, lyophilized and analyzed by either ES or MALDI mass spectrometry. 

This procedure can also be used for the mass-modified nucleic acid primers 
UP in an analogous and appropriate way, taking into account the chemical properties 
of the mass-modifying functionalities. 

EXAMPLE! 

Immobilizalioii of primer-atension products of Sanger DNA sequencing raiction 
for mass spectrometric analysis via the ievulinyl group 

S-Aminolevulinic acid is protected at the primary amino group with the 
Fmoc group using 9-fluorenyhnethyl N-sucdnimidyl carbonate and is then transformed 
into the N-hydroxysuccinimide ester (NHS ester) using N-hydroxysuccinimide and 
dicyclohexyi carbodiimide under standard conditions. For the Sanger sequencing 
reactions, nucleic acid primers. UP^'\ are used which are fiinctionalized with a primary 
amino group at the 5'-end introduced by standard procedures during automated DNA 
synthesis with aminolinker phosphoamidites as the final synthetic step. Sanger 
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sequencing is performed under standard conditions (see above). The four reaction 
mixtures ( 1 50 ul each in an Eppendorf tube) are heated to 70^C for 10 min-to inactivate 
the DNA polymerase, ethanol precipitated, centrifuged and resuspended in 10 ul of 10 
mM TEAA buflFer pH 7.2. 10 ul of a 2 mM solution of the Fmoc-5-aminoIevulinyI-NHS 
5 ester in 10 mM TEAA buffer is added, vortexed and incubated at 25^C for 30 min. The 
excess of the reagent is removed by ethahol precipitation and cratrifiigation. The Fmoc 
group is cleaved off by resuspending the peUets in 10 ul of a solution of 20% piperidine in 
KN.diniethylfoniianude/water (l:l v/v). After 15 min at 25^C, piperidine is thoroughly 
removed by three precipitations/centrifugations with 100 ul each of ethaiiol, the pielUets 

10 are resuspended in 10 ul of a solution of N-methybnbrpholihe, 2^propanol and water 

(2/10/88 v/v/v) and are coupled to the solid support canying an isothiocyanate group. In 
the case of the DITC-Sequelon membrane (Millipoiie Corp., Bedford, MA), the 
membranes are prepared as desmbed in EXAMPLE 1 and coupling is achieved on a 
heating block at 55^C as described above. KNA extension products are inunobilized in 

1 5 an analogous way. The procedure can be applied to other solid supports with 
isothiocyanate groups in a ^milar manner. 

The immobilized primer-extension products are extensively washed three 
times with 100 ul each of NMM solution and three times with 100 ul 10 mM TEAA 
buffer pH 7.2. The purified primer-extension products are released by three successive 

20 treatments with 10 ul of 100 mM hydrazinium acetate buffer pH 6.5, lyophilized and 
analyzed by either ES or MALDI mass spectrometry. 

EXAMPLE 3 

25 Immobilization of primer^tension products of Sanger DNA sequencing reaction 
for mass spectrometric analysis via a ttypsin sensitive linkage 

Sequelbn DITC membrane disks of 8 mm diameter (Millipore Corp., 
Bedford, MA) are wetted with 10 ul of NMM solution (N-methylmorpholine/propanaol- 
2/water; 2/49/49 v/v/y) and a linker arm imroduced by reaction with 10 ul of a 10 mM 
30 solution of 1 ,6-diaminohexane in NMM. The excess diamine is removed by three 

washing steps with 100 ul of NMM solution. Using stahdard peptide synthesis protocols. 
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two L-lysine residues are attached by two successive condensations with N-Fmoc-N- 
tBoc-L-lysine pentafluorophenylester, the temnnal Fmoc group is removed with 
piperidnie in NMM and the free a-amino group coupled to 1,4-phenylene 
, diisothiocyanate (DITC). Excess DITC is removed by three washmg steps with 100 ul 2- 
propanol and the N-tBoc groups removed with tiifluoroacetic acid according to standard 
. peptide synthesis procedures. The nucleic add primer-extension products are prepared 
- *0m oligonucleotides which carry a primary amino group at the 5'-teraunus. The four 
Sanger DNA sequencing reaction nnxtures (150 ul each in Eppendorf tubes) are heated 
for 10 min at 70 C to inactivate the DNA po^erase, ethanol precipitated, and the 
peDets resuspended in 10 ul of a solution of N-methylmorpholinc, 2-propanoi and water 
(2/10/88 v/v/v). This sohition is transferred to the Lys-Lys-DITC membrane disks and 
coupled on a heating block set at 55°C. After drying. 10 ul of NMM solution is added 
and the drying process repeated. 

The immobilized primer-extension products are extensively washed three 
times with 100 ul each of NMM solution and three times with 100 ul each of 10 mM 
TEAA buffer pH 7.2. For mass spectrometric analysis, the bond between the primer- 
extension products and the solid support is cleaved by treatment with trypsin under 
standard conditions and the released products analyzed by either ES or MALDI mass 
spectrometry with tiypsin serving as an imemal mass standard. 

EXAMPrK4 

ImmobiUzaUon ofprinier-cxteilsioii products of Sanger DNA sequencing reaction 
for mass spectrom^c mtkiy^ via pyrophosphate linkage 

The DITC Sequdon mnnbrane (disks of 8 mm diameter) are prepared as 
described in EXAMPLE 3 and 10 ul of a 10 mM sohition of 3-aininopyridine adenine 
dinudeotide (APAD) (Sigma) in NMM sohition added. The excess APAD is removed by 
a 1 0 ul wash of NMM solution and the disks are treated with 1 0 ul of 1 0 inM sodium 
periodate in NMM solution (15 min, 25®C). Excess periodate is removed and the 
primer-extension products of the four Sanger DNA sequencing reactions (1 50 ul each in 
Eppendorf tubes) employing nucleic add primers with a primary amino group at the 5*- 
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end are ethanol precipitated, dissolved in 10 ul of a solution of N-methylmorphoiine/2- 
propanol/water (2/10/8S v/v/v) and coupled to the 2* 3 -dialdehydo groups^f the 
immobilized NAD analog. 

The primer-extension products are extensively washed with the NMM 
5 solution (3 times with 100 ul each) and 10 mM TEAA buffer pH 7.2 (3 times with 100 ui 
each) and the purified primer-extension products are released by treatment with either 
NADase or pyrophosphatase in 10 mM TEAA buffer at pH 7.2 at 37^C for 15 nun, 
iyophilized aiid analyzed by either ES or MALDI mass spectrometry, the enzymes; serving 
as nitemal mass stamfairds. 

10 

Synthesis of nucleic acid primers mass-modified by glycine residues at the 5*- 

positidn of the siig^r inoiety of the terminal nncleosid(e 

IS Oligonucleotides are synthesized by istandard automated DN A s^ 

using B-cyahbethylphbsphoamidites (H. Koster et al^ Nucleic Acids Res i2, 4539 

(1984)) and a S*-amin6 group is introduced at the rad of solid ph^e DN A synthesis (e.g. 

Agrawal et aL, Nuckig Acids Res, 14, 6227-45 (1986) or Sproat et fl/., Nyclgi<? Agids 

Res. 15, 6 1 8 1 -96 (1987)). The total amount of an oligonucleotide synthesis, starting 

20 with 0.25 umol CPG-bound nucleoside, is deprotected with concentrated aqueous 

TM 

ammonia, purified via OligoPAK Cartridges (Millipore Corp., Bedford, MA) and 
Iyophilized. This material with a 5 -terminal amino group is dissolved in 100 ul absolute 
N^N-dimetlqrlformatnide (DMF) and condensed with 10 pinole N-Fmoc-glycine 
pentafluorophenyl ester for 60 min at 25^C. After ethaifiol predpitatioii and 

25 ' d^trifiigatioh, the Fnioc grbup is cleaved off by a 10 ihin treatment with 1 00 ul of a 

solution of 20% piperidiAe in N,N-dimethylfonhamide. Excess piperidine, DMF and the 
cleavage product firoih the Fnioc group ai^e removed by ethanol precipitation and the 
precipitiate Iyophilized fh)m 10 mM TEAA buffer pH 7.2 This material is now either 
used as prim^* for the Sanger DN A sequencing reactions or one or more {glycine residues 

30 (or other suitable protected amino acid active esters) are added to create a series of mass- 
modified primer oligonucleotides suitable for Sanger DNA or RNA sequencing. 
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Immobilization of these mass-modified nucleic acid primers UP^ ^ after primer-e>ctension 
during the sequencmg process can be achieved as described, e.g., in EXAMPLES 1 to 4. 

5 

Synthesis of nucleic acid primers mass-modified at C-5 of the heterocyclic base of a 
'pynmidine nudeoside with glydne residues 

Starting material was S-(3-aniinopropynyl-l )-3' S'-di^tolyldeoxyuridine 
prepared and 3' 5 -de-O-acylated according to literature procedures (Haralambidis et al. , 

10 Nucleic Adda Res IS 4857-76 (1987)). 0.281 g (1.0 mmole) S.(3-am2nopropyny|.l)-2'- 
deoxyuridine were reacted Moth 0.927 g (2.0 mmole) N-Fmoc-glycine 
pentafluorophenylester in S ml absolute N^N-dimethylformamide in the presence of 0. 129 
g (1 mmole; 174 ul) N^N-diisopropylethylaniine for 60 min at room temperature. 
Solvents were removed by rotaiy evaporation and the product was purified by silica gel 

15 chromatography (Kieselgel 60, Merck; column: 2.5x 50 cm, ehition with 

chloroform/methanol mbctures). Yield was 0.44 g (0.78 mmole, 78 %). In order to add 
another giydne residue, the Fmoc group is removed with a 20 min treatment with 20% 
solution of piperidine in DMF, evaporated in vacuo and the remaining solid material 
extracted three times with 20 ml ethylacetate. After having removed the remaining 

20 ethylacetate, N-Fmoc-glydne pentafluorophenylester is coupled as described above. 5- 
(3-(N-Fmoc-glycy])-amidopropynyl-l)-2'-deoxyuridine is transformed imo the 5'-0- 
dimethoxytritylated nucleoside-3 -0-B-cyanoethy]-N,N-diisoprop^phosphoaniidite and 
incorporated imo automated oligonucleotide synthesis by standard procedures (H. Kdster 
etal. Nucleic Acids Res. 12, 2261 (1984)). This glycine modified thymidine analogue 

25 building block for chemical DNA synthesis can be used to substitute one or more of the 
thymidineAiridine nucleotides in the nudeic acid primer sequrace. The Fmoc group is 
removed at the end of the solid phase synthesis with a 20 min treatment with a 20 % 
solution of piperidine in DMF at room temperature. DMF is removed by a washing step 
with acetonitrile and the oligonudeotide deprotected arid purified in the standard way. 

30 



EXAMPLE 7 
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Synthesis of a niicleic acid primer mass-modified at C-S of tiie beteroeyclic base of 
a pyrimidine nucleoside with D-alanine residues 

Starting material was the same as in EXAMPLE 6. 0.281 g (1 .0 mmole) 
5-(3-Aminopropynyl-l>2'-deoxyuridine was reacted with N*Fmoc-B-alanine 
pentafluorbphenylester (0.955 g. 2.0 mmole) in 5 ml N,N-dimethylfonnamide (DMF) in 
the presence of 0.129 g (174 ul; 1.0 mmole) N,N-disopr6pyleihylamine for 60 min at 
room temperature. Solvents were removed and the product purified by silica gd 
chromatography as described in EXAMPLE 6. Yield was 0:425 g (0.74 mmole, 74 %). 
Another B-alanine moiety can be added in exactly the same way after removal of the 
Fmoc group. The preparation of the 5'-0-dimethoxytritylated nucieoside*3'-O-0- 
cyarioethyl-N.N-diisopropylphosphoamidite from 5-(3-(N-Fmoc-BralanyI). 
amidopropynyM)^2^eoxyuridine and incorporation into automated oligonucleotide 
synthesis is performed under standard conditions. This building block can substitute for 
any of the thymidine/uridine reddues in the nucleic acid primer sequence. In the case of 
only one incorporated mass-modified nucleotide, the nucleic acid primer molecules 
prepared according to EXAMPLES 6 and 7 would have a mass difference of 14 daltons. 



wo 97/17041 



PCTAJS97/D43M 



.43. 

EXAMPLES 

Synthesis of a nudeic acid primer mass-modified at C-5 of the heterocyclic base of 
a pyrimidine nucleoside with ethylene glycol monomethyl ether 

As a nucleosidic component, S-(3-aminopropynyM)-2'-deoxyuridine was 
used in this example (see EXAMPLES 6 and 7). The mass-modifying functionality was 
bEtained as follows: 7.61 g (100.0 mmole) freshly distilled ethylene glycol monomethyl 
ether dissolved in 50 ml absolute pyridine was reacted with 10.01 g (100.0 mmole) 
recrystaUized succinic anhydride in the presence of 1 .22 g (10.0 nrniole) 4-N,N- 
dimethylaminopyridine overnight at room temperature. The reaction was terminated by 
the addition of water (5.0 ml), the reaction mixture evaporated in vacuo, co-evaporated 
twice with dry toluene (20 ml each) and the residue redissolved in 100 ml 
dichloromethane. The solution was extracted successively, twice with 10 % aqueous 
citric acid (2 x 20 ml) and once with water (20 ml) and the organic phase dried over 
anhydrous sodium sulfate. The organic phase was evaporated in vacuo, the residue 
redissolved in 50 ml dichloromethane and precipitated into 500 ml pentane and the 
predpitate dried in vacuo. Yield was 13. 12 g (74,0 nunole; 74 %). 8.86 g (50.0 mmole) 
of succinylated ethylene glycol monomethyl ether was dissolved in 100 ml dioxane 
containing 5% dry pyridine (5 ml) and 6.96 g (50.0 mmole) 4-nitrophenol and 10.32 g 
(50.0 mmole) dicydohexylcarbodiimide was added and the reaction run at room 
temperature for 4 hours. Dicyclohexylurea was removed by filtration, the filtrate 
evaporated in vacuo and the residue redissolved in 50 ml anhydrous DMF. 12.5 ml 
(about 12.5 mmole 4-nitrophenylester) of this solution was used to dissolve 2.81 g (10.0 
mmole) 5-(3-aminopropynyM)-2'-deoxyuridine. The reaction was performed in the 
presence of l.OI g (lO.O mmole; 1.4 mi) triethylamine at room temperature overnight. 
The reaction mixture was evaporated in vacuo^ co-evaporated with toluene, redissolved 
in dichloromethane and chromatographed on silicagel (Si60, Merck; column 4x50 cm) 
with dichloromethane/methanol mixtures. The fiiactions containing the desired compound 
were collected, evaporated, redissolved in 25 ml dichloromethane and precipitated into 
250 ml pentane. The dried precipitate of 5-(3-N-(0-succinyl ethylene glycol monomethyl 
ether)-amidopropynyH)-2'-deoxyuridine (yield: 65 %) is 5'-0-dimethoxytritylated and 
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transformed into the nucIeoside-3 -0-B-cyanoethyl-N, N-diisopropylphosphoamidite and 
incorporated as a building block in the automated oligonucleotide synthesis^ according to 
standard procedures. The mass-modified nucleotide can substitute for one or more of the 
thymidine/uridine residues in the nucleic acid primer sequence. Deprotection and 
purification of the primer oligonucleotide alsd follows standard procedures. 
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EXAMFLEy 

Synthesis of a nudeic acid primer mass-modified at C*5 of the heterocyclic base of 
a pyrimidine nucleoside with diethylene glycol monometbyl ether 

Nucleosidic starting material was as in previous examples, S-(3- 
aminopropynyl- 1)-2 -deoxyuridine. The mass-modiiying functionality was obtained 
aiSilar to EXAMPLE 8. 12.02 g (100.0 nunoie) freshly distiUed diethylene glycol 
monomethyl ether dissolved in SO ml absolute pyridine was reacted with 10.01 g (lOO.O 
mmole) recrystallized succinic anhydride in the presence of 1.22 g (10,0 mmole) N- 
dimethylaminopyridine (DMAP) overnight at room temperature. The work-up was as 
described in EXAMPLE 8. Yield was 18.35 g (82.3 mmole, 82.3 %). 1 1.06 g (50.0 
mmole) of succinylated diethylene glycol monomethji ether was transformed into the 4- 
nitrophenylester and, subsequently, 12.5 mmole was reacted with 2.81 g (10.0 mmole) of 
5.(3.aminopropynyl-l)-2'-deoxyuridine as described in EXAMPLE 8. Yield after silica 
gel column chromatography and precipitation imo pentane was 3.34 g (6.9 mmole, 69 
%). After dimethoxytritylation and transformation into the nucleoside-!)- 
cyano^hylphosphoamidite, the mass-modified building block is incorporated into 
automated chemical DNA synthesis according to standard procedures. Within the 
sequence of the nucleic acid primer UP^"', one or more of the thymidine/uridine residues 
can be substituted by this mass-modified nucleotide. In the case of only one incorporated 
mass-modified nucleotide, the nucleic acid primers of EXAMPLES 8 and 9 would have a 
mass difference of 44.05 daltons. 

EXAMPLE IQ 

Synthesis of a nucleic acid primer mass-modified at C-8 of the heterocyclic base of 
deoxyadenosine with glycine 

Starting material was N^-ben2oyI-8-bromo-5*-0-(4,4'-dimethoxytrityl)-2*- 
deoxyadenosine prepared according to literature (Singh et al. Nucleic Adds R^ f^ 
3339-45 (1990)). 632.5 mg (1.0 mmole) of this 8-bromo-deoxyadenosine derivative was 
suspended in 5 ml absolute ethanol and reacted with 251 .2 mg (2.0 mmole) glycine 
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methyl ester (hydrochloride) in the presence of 241,4 mg (2.1 mmole; 366 ul) N, N- 
diisopropylethylamine and refluxed until the starting nucleosidic material had disappeared 
(4-6 hours) as checked by thin layer chromatography (TLC). The solvent was 
evaporated and the residue purified by silica gel chromatography (column 2.5x50 cm) 
S using solvent mixtures of chlorofonn/methanol containing 0. 1 % pyridine. The product 
fractions were combined, the solvent esvaporated, the fractions dissolved in 5 ml 
dichlcromethane and precipitated into 100 ml pentane. Yidd was 487 mg (0.76 mmole, 
76%). Transformation into the corresponding nuclra^de-B-^anoethyl^^ 
and integration into automated chemical DNA synthesis is performed under staxidard 
1 0 conditions. During final deprotection with aqueous concentrated ammonia; the methyl 
group is removed fi'om the glycine moiety. The mass-modified building block can 
substitute one or more deoxyadenosine/adenosine residues in the nucldc acid primer 
sequence. 

15 EXAMTLEU 

Synthesis of a nucleic acid primer mass-modified at C-8 of the heterocyclic base of 
dcoxyadenosine with glycylgiydne 

This derivative was prepared in analogy to the glydne derivative of 
20 EXAMPLE 10. 632.5 mg (1 .0 mmole) N^-Ben2oyl-8.bromo-5»-0-(4,4'- 

dimethoxytrityt)-2-deoxyadenosine was suspended in 5 ml absolute ethanol and reacted 
with 324.3 mg (2.0 mmole) glycyl-glydne methyl ester in the presence of 241.4 mg (2.1 
mmole, 366 pi) 

K N-diisopropylethylamine. The mixture was refluxed and completeness of the reaction 
25 checked by TLC. Work-up and purification was similar to that described in EXAMPLE 
10: Yield after silica gel column chromat6gn4>hy and precipitation into pentane was 464 
nig (0.65 mniole, 65 %). Transformation into the nucleoside-B- 
cyanoethylphosphoaitnidite and into synthetic oligonucleotides is done according to 
standard procedures. In the case where only one of the deoxyadenoane/adeho«ne 
30 residues in the nuddc add primer is substituted by this mass-modified nucleotide, the 
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mass difference between ihe nucleic add primers of EXAMPLES 10 and 1 1 is 57.03 dahons. 

EXAMPLE 12 

Synthesis of a nucleic acid primer mass-modified at the C-2* of the sugar moiety of 
2*-amino-2*-deozythymidine with ethylene glycol monomethyl etiier residues 
' Starring nfmtcrial was 5^0<4,4-dimcthoxytrityI)-2'-an^ 

deoxythymidine synthesized according to published procedures (e.g., Verheyden et al.^ JL 
Org. ChCTl> 2fi. 250-254 (1971); Sasaki etal, 1. Or g Chem 41, 3138-3143 (1976); 
Imazawa ei al, J. Org. Chcm ^ 2039-2041 (1979): Hobbs elaL. J. Org Chem Ai 
714-719 (1976); Ikchara et al, Chem Pharm Bull Japan ifi 240-244 (1978); see also 
PCX Application WO 88/00201). 5'T<M4.4-Dimethoxytrityl).2'-ainino.2'- 
dcoxythymidine (559.62 mg; 1.0 nunole) was reacted with 2.0 mmole of the 4- 
nitrophenyl ester of succinylated ethylene glycol monomethyl ether (see EXAMPLE 8) in 
10 ml dry DMF in the presence of 1 .0 mmole (140 \x\) triethylamine for 1 8 hours at room 
temperature. The reaction mixture was evaporated in vacuo, co-eyaporated with 
toluene, redissolved in dichloromethane and purified by silica gel chromatography (Si60, 
Merck; column: 2.5x50 cm, eluent: chloroform/methanol mixtures containing 0.1 % 
triethylamine). The product containing fractions were combined, evaporated and 
precipitated into pentane. Yield was 524 mg (0.73 mmol; 73 %). Trantfonration into 
the nucieoside-B-cyanoethyl-N,N-diisopropyIpho5phoamidite and incorporation into the 
automated chemical DNA synthesis protocol is performed by standard proc^ures.. The 
mass-modified deoxythymidine derivative can substitute for one or more of the thymidine 
residues in the nucleic add primer. 

In an analogous way. by employing the 4-nhrophenyl ester of succinylated 
diethylene glycol monomethyl ether (see EXAMPLE 9) and triethylene glycol 
monomethyl ether, the corresponding mass-modified oligonucleotides are prepared. In 
the case of only one incorporated mass-modified nucleoside within the sequence, the 
mass difference between the ethylene, diethylene and triethylene glycol derivatives is 
44.05, 88.1 and 132.15 daltons respeaively. 
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EXAMTLEl? 

Synthesis of a nucleic acid primer mass-modified in the internudeotidic linkage via 
alkylation of phosphorothioate groups 
S Phosphorothioate-containing oligonucleotides were prepared iaccording to 

standard procedures (see eg. Gait et ai:. Nucleic Acids Res 19 1 183 (1991)). One, 
several or all internucieotide linkages can be modified in this way: The (-)-M13 nucleic 
add primer sequence (17-mer) 5'-dGTAAAACGACGGGCAGt was synthe^ed in 0.25 
pmole scale on a DNA synthesizer and one phosphorotMoate group introduced after the 

10 final i^heisis cycle (G to T coupfing). Sulfiuization, deprotectton and purification 
followed standard protocols. Yidd was 3 1 .4 nmole ( 1 2.6 % overall >^eld), 
corresponding to 31.4 nmole phosphorothioate groups. Alkylation was performed by 
dissolving the residue in 3 1.4 ^1 TE buffer (0.01 M tris pH 8.0, 0.001 M EDTA) and by 
addiiig 16 pi of a sohidon of 20 mM sohition of 2-iod6ethanol (320 hihole; i.e.» lO-fold 

15 Access with respect to phosphorothioate diesters) in N,N-dini^hylfonnaniide (DMF). 
The alkylated oligonucleotide was purified by standard reversed phase HPLC (RP-18 
Uitraphere, Beckman; column: 4.5 x 250 mm; 100 mMtriethylammonium acetate* pH 7.0 
and a gradient of 5 to 40 % acetonitrile). 

In a variation of this procedure, the nucleic add primer containing one or 

20 more phosphorothioate phosphodiester bond is used in the Sanger sequendng reactions. 
The primer^xtehsion products of the four sequencing reactions are purified as 
exemplified in EXAMPLES 1 - 4, cleaved off the solid support, lyophilized and dissolved 
in 4 pi each of TE buffer pH 8.0 and alkylated by addition of 2 pi of a 20 mM solution of 
2*iodoethanol in DMF. It is then analyzed by ES and/or MALDI mass spectrometry. 

25 In an analogous way, onploying instead of 2-iodoethahol, e.g., 3- 

iodopropahol, 4-i6d6butahol mass-modiiSed nucleic acid primer are obtained with a mass 
(fiffereiice of 1403, 28.06 and 42.03 daltons respectively compared to the unmodified 
phosphorothioate phosphodiester-containing oligonucleotide. 
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Synthesis f 2*-ainino-2*-de x3ruridine-5*-triphosphate and 3*-aniino-2%3*- 
dideoxytbyinidine-5*-triphosphate mass-modified at the 2** or 3'-amino function 
with glycine or D-alanine residues 

Starting material was 2 -azido-2'*deoxyuridine prepared according to 
literature (Verheyden ei aL . J Org Chem 36^ 2S0 (1971)), which was 4,4> 
dtmethoxytntylated at 5'-OH with 4,4-diniethoxytrityl chloride in pyridine and acetylated 
at 3'-0H Mdth acedc anhydride in a one^wt reaction using standard reaction conditions. 
With 191 mg (0.71 mmole) 2 -azido-2'-deoxyuridine as. starting material, 396 mg (0.6S 
mmol, 90.8 %) S-CH4,4-dimethoxytrityl)-3 -0-acetyl-2-a2]do-2'-d^ 
obtained after purification via silica gel chromatography. Reduction of the azido group 
was performed using published conditions (Barta et al. Tetrahedron 46 . 587-594 
(1990)). Yield of 5*-0-(4.4-dimethoxytrityl)-3'-0-acetyl-2'.amino-2'-deoxyuridine after 
silica gel chromatography was 288 mg (0.49 mmole; 76 %). This protected 2-aminD-2'- 
deoxyuridine derivative (588 mg, l.Ommole) was reacted with 2 equivalents (927 mg, 
2.0 mmole) N-Fmoc-glycine pentafluorophenyl ester in 10 ml dry DMF overnight at 
room temperature in the presence of 1,0 mmole (174 pi) N,N-diisopropylethylamine. 
Solvents were removed by evaporation in vacuo and the residue purified by silica gel 
chromatography. Yield was 71 1 mg (0.71 mmole, 82 %). I>etritylation was achieved by 
a one hour treatment with 80% aqueous acetic add at room temperature. The residue 
was evaporated to dryness, co-evaporated twice with toluene, suspended in 1 ml dry 
acetonitrile and 5'-phosphoryiated with POCI3 according to literature (Yo^kawa et ai, 
Bltll, Chem. Sog, Japan 42, 3505 (1969) and Sowa ei ai. Bull Chem Soc Japan 
2084 (1975)) and directly transformed in a one-pot reaction to the 5Vtriphosphate using 3 
ini of a 0.5 M solution (l.S mmole) tetra (tri-n-butylammonium) pyrophosphate in DMF 
according to literature fe e. Seela et aL. Helvetica Chimica Acta 74 1048(1991)). The 
Fmoc and the 3 -0-acetyl groups were removed by a one-hour treatment with 
concentrated aqueous anunonia at room temperature and the reaction mixture evaporated 
and lyophilized. Purification also followed standard procedures by using anion-exchange 
chromatography on DEAE-Sephadex with a linear gradient of triethylammonium 
bicarbonate (0. 1 M - 1.0 M). Triphosphate containing fractions (checked by thin layer 
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chromatography on polyethyieneimine cellulose plates) were collected, evaporated and 
lyophilized. Yield (by UV-absorbance of the uracil moiety) was 68% (a.48-mmole). 

A glycyl-glycine modified 2 -amino-2*-deoxyuridine-5'-triphosphate was 
obtained by removing the Fmoc group from 5 -0-(4,4-dimethoxytrityl)-3 -O-acetyI-2 -N- 
5 (N-9-fluorenylmethyloxycarbonyUglycyl)-2 -amino-2'-deoxyuridine by a ohe^hour 

treatment with a 20% solution of piperidine in DMF at room tonperature, evaporation of 
Events, two*fold co-evaporation with toluene and subsequent condensation with N- 
Fmoc-glycine pentafluorophenyl ester. Starting with I.O mmole of the 2-N-glycyt-2'- 
amino-2'-deoxyuridine derivative and following the procedure described above, 0.72 

1 0 mmole (72%) of the corresponding 2'-<N-glycyl-gIycyl>-2'^amina-2'-deoxyuridine-S - 
triphosphate was obtained. 

Starting with 5*-CK4.4-dimethoxytrityl)-3'.O^accty|.2*-amino-2*- 
debxyuridme and coupling with N-Fmoc-B-alanirie pentafluorophenyl ester, the 
correspohding 2*-<N-fi-alanyl)-2'-^anuno-2-d can be 

15 synthesized. These modified nucleoside triphosphates are incorporated during the Sanger 
DNA sequencing process in the primer-extension products. The mass difference between 
the glycine, B-alanine and glycyl-glycine mass-modified nucleosides is, per nucleotide 
incorporated, 58,06, 72 09 and 1 15.1 daltons respectively. 

When starting with 5 •0-(4,4.dimethoxytrityl)-3*-amino-2',3 - 

20 didecxythymidineXobtained by published procedures, see EXANffLE 12), the 

corresponding 3 -(N-glycyO-B'-amino-/ 3 -(-N-glycyl-glycyl)-3-amino-/ and 3 -(N-B- 
alanyl)-3-amino-2*,3-dideoxythymidine-5 -triphosphates can be obtained: These mass- 
modified nucleoside triphosphates serve as a tenninating nucleotide unit in the Sanger 
DNA sequencing reactions providing a mass difference per terminated fragment of 58.06, 

25 72:09 and llS.l daltons respectively when used in the multiplexing sequencing mode. 
The mass-differentiated fragments can then be analyzed by ES and/or MALDI mass 
spectrometry. 
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Synthesis of deoxyundine-S'-triphosphate masMnodified at C-S of the heterocyclic 
base with glydnetgtycyl-glycine and D-alanine residues. - - 

0.281 g (1.0 nunole) 5-(3-AnunopFopyi^-l)-2'-deoxyundine (see 
EXAMPLE 6) was reacted with cither 0.927 g (2.0 nunole) N-Fmoc-glydne 
pentafluorophenylestN* or 0.9S5g (2.0 nunole) N-Fmoc-fl-alanine pentafluorophenyl ester 
in 5 ml dry DMF in the presence of 0.129 g N-diisopropylethylanune.(174 ul, 1.0 
iiinldle) overnight at room temperature. Solvents were removed by evaporation in vacuo 
and the condensation products purified by flash chromatography on silica gei (Still et ai, 
h Org. Chcm, 42, 2923-2925 (1978)). Yields were 476 mg (0,85 nunole: 85%) for the 
glycine and 436 mg (0.76 mmole; 76%) for the 0-alanine derivatives. For the synthesis of 
the glycyl-glycine derivative, the Fmoc group of 1.0 mmole Fmoc-glycine-deoxyuridine 
derivative was removed by one-hour treatment with 20% piperidine in DMF at room 
temperature. Solvents were ranoved by evaporation in vacuo, the residue was co- 
evaporated twice with toluene and condensed with 0,927 g (2.0 mmole) N-Fmoc-glycine 
pentafluorophenyl ester and purified as described above. Yield was 445 mg (0.72 nunole; 
72%). The glycyl-, glycyl-glycyl- and B.alanyl-2 -deoxyuridine derivatives, N-protected 
with the Fmoc group ^yere transformed to the 3 -O-acetyl derivatives by tritylation with 
4,4^dimethoxytrityl chloride in pyridine and acetylation with acetic anhydride in pyridine 
in a one-pot reaction and subsequendy detritylated by one hour treatment with 80% 
aqueous acetic acid according to standard procedures. Solvents were removed, the 
residues dissolved m 100 ml chloroform and e?ctracted twice with 50 ml 10% sodium 
bicarbonate and once with 50 ml y/at&, dried mth sodium sulfate, the solvent evaporated 
and the residues purified by flash chromatography on silica gel. Yields were 361 mg 
(0.60 mmole; 71%) for the glycyl-, 35 1 mg (0.57 mmole; 75%) for the B-alanyl- and 323 
mg (0.49 mmole; 68%) for the glycyl-glycyI.3^0'-acctyl-2*-deoxyuridine derivatives 
respectively. Phosphorylation at the 5'-0H with POCI3, transformation into the 5'- 
triphosphate by in-situ reaction with tetra(tri-n-butylammonmm) pyrophosphate in DMF, 
3-de-O-acetylation, cleavage of the Fmoc group, and final purification by anion-exchange 
chromatography on DEAE-Sephadex was performed as described in EXAMPLE 14. 
Yields according to UV-absorbance of the uracil moiety were 0.41 mmole 5-(3-(N- 
glycyI)-amidopropynyl-l)-2*-deoxyuridine-5'-triphosphate (84%), 0,43 mmole 5-(3-(N-B- 
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aianyl)-amidapropynyl-l)-2-deoxyuridine-S -triphosphate (75°^^ and 6.38 mmole 5-(3- 
{N-gIycyl-gly<yl)-ainidopropynyI-l )-2 -deoxyuridine-5'-tri^ 

These mass-modified micleoside triphosphates were incorporated during the 
Sanger DNA sequencing prim^-ext^sion reactions. 

Vfhen using 5-(3-aminopropynyt^l)-2^3'*^ideo9cyuridih^ as startihg mataial 
and foHowing aii analogous reaction sequence the corresponding gly^-, glycyl-glycyl- 
ahd fi^lanyl-2\3*-dideoxyuridine-5'*triphosphates were obtained in yiidds of 69, 63 and 
71% respectively. These mass^modified nucleoside triphosphates serve as duun- 
terminating nucleotides during the Sanger DNA sequencing reactions. The maiss- 
modified sequencing ladders are analyzed by either ES or MALDl mass spectrometry. 

Synthesis of S-glycyk and 8-gIycyI-glycyl-2*'-deoiyadenosihe-'5*-(riphdsphate 

6 

727 mg (1.0 mmole) of N -(4-tert-butyIphenoxyacetyl)-8-g!ycyl-5'-(4,4- 
dimethoxytrityl)-2 - deo)cyadenoane or 800 mg (1 .0 mmole) N^-(4-tcrt- 
butylphenoxyacetyl)-S-glycyl-glycyI-S-(4,4-dimethoxytrityl)-2'Hleoxyadenosine prepared 
according to EXAMPLES 10 and 1 1 and literature (KOster et aL, Tetrahedron 27, 362 
(1981)) were acetylated with acetic anhydride in pyridine at the 3 -OH, detritylated at the 
5 -position with 80% acetic acid in a one-pot reaction and transformed into the 5 - 
triphosphates via phosphorylation with POCI3 and reaction in-situ with tetra(tri-n- 
butylammonhim) pyrophosphate as described in EXAMPLE 14. Deprotection of the N^- 
tm4>utylphen6xyacetyK the 3 -0-acetyl and the O-methyl group at the glycine residues 
was achi^ed with concentrated aqueous ammonia Ibr ninety minutes at room 
temperature. Ammonia was removed by lyophilization and the residue washed with 
dichloromethane, solvent removed by evaporation in vacuo and the remaining solid 
materia] purified by anion-excha^ge chromatography on DEAE-Sephadex using a linear 
gradiem of trietfaylammonium bicarbonate from 0.1 to 1:0 M: The nucleoside 
triphosphate containing fi-actions (checked by TLC on poly^hyleneimine cellulose plates) 
were combined and lyophilUzed. Yield of the 8-giycyl-2'-deoxyadenoane^S**triphosphate 
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(dctermined by UV-absorbance of the adenine moiety) was 57% (0.57 mmole). The yield 
for the 8-glycyl-glycyl-2'-iieoxyadenosine-5 -triphosphate 51% (OrSI 4nmole). 

These mass-modified nucleoside tripho^hates were incorporated during 
primer-extension in the Sanger DNA sequencing reactions. 

When using the corresponding N6*(4-tert-butylphenoxyacetyl>8-glycyl- or 
-glycyI.glycyl-5*-0.(4,4.dimcthoxytrityI)-2\3'-dide^^ derivatives as starting 

- iiiaterials prepared according to standard procedures (see. e.g.. for the introduction of the 
2\3'-fiinction: Seda et aL Helvetica Thtmiea A^^ 74 1048-1058 (1991)) and using an 
analogous reaction sequence as described above, the chain-terminathig roass-mpdified 
mideoside triphosphates 8-gIycyl- and 8-gIycyI.glycy|.2'.3*-dideoxyadenosine-5'- 
triphosphates were obtained in 53 and 47% yields respectively. The mass-modified 
sequencing fragment ladders are analyzed by either ES or MALDI mass spectrometry. 

EXAMFLE 17 

Mass-modification of Sanger DNA sequencing fragment ladders by incorporation 
of chain-dongating 2'-dcoxy- and chain-terminating 2%3'-dideoxythymidine-5*- 
(alpha*S-)-triphosphate and subsequent allgrlation with 2-iodoethanol and 3- 
iodopropanol 

2',3 -Dideoxythymidine-5*-(alpha-S)-triphosphate was prepared according to 
pubMshed procedures (e.g., for the alpha-S-triphosphate moijsty: Eckstein et aL, 
B i ochem i afy ii, 1685 (1976) and Accounts Chem Jt^ A7 204 (1978) and for the 2',3'- 
dideoxv moietv: Seda et aL Helvetica Chimi«i Ana yd 1048-1058.(1991)). Sanger 
DNA sequencing reactions emptejrog 2'.deoxythymidine-5Hdpha-S>-tripho^ are 
performed according to standard protocols (e.g. Eckstdn, Ann Rev Binrhftm 54, 357 
(1985)). When using 2',3'-dideoxythymidine-5*.(alpha.S>.triphpsphates, this is used 
instead of the unmodified 2\3*-^ideoxythymidine-y.triphosphate in standard Sanger DNA 
sequencing (see e.g. Swerdlow et al. Nudeic Acid<s 1 R 1415-1419(1990)). The 
template (2 pmole) and the nucleic acid MI3 sequencing primer (4 pmole) modified 
according to EXAMPLE 1 are annealed by heating to 65^C in 100 ul of 1 0 mM Tri^HCI 
pH 7.5, 10 mM MgClj, 50 mM NaCl, 7 mM dithiothreitol (DTT) for 5 min and slowly 
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brought to 37^C during a one hour period. The sequencing reaction mixtures contain, as 
exemplified for the T-spedfic termination reaction, in a final volume of 150^1, 200 uM 
(final concentration) each of dATP, dCTP, dTTP, 300 liM c7-deaza-dGTP, 5 uM 2',3'- 
dideoxythymidine*5**(dIpha'^S)-triphosphate and 40 units Sequenase (United States 
5 Biochemicals). Polymerization is performed for 10 min at 37^C, the reaction mixture 

heated to 70^C to inactivate the Sequenase. ethanol precipitated and coupled to thiolated 
Sequelon membrane disks (8 mm diam^er) as described in EXAMPLE 1 . Alkylation is 
performed by treating the disks with 10 ut of 10 mM solution of either 2-i6dbethanol or 
3-iodoprbpanol in NMM (N-methybnorpholineAvater/2-propanol, 2/49/49; v/v/v) (three 
1 0 times)» washing with 1 0 ul NMM (three tinies) and cleaving the alkylated T-^terminated 
primer-extension products off the support by treatment with DTT as described in 
EXAMPLE 1 . Analysis of the mass-modified fragment fiimilies is performed with either 
ES or MALDI mass spectrometry. 

15 EXAMTLEI? 

Analysis of a Mixture of Oligothymidylic Acids 

Oligothymidylic add, oligo p(dT) j2-i 8> is commercially available (United 
States Biochemical, Cleveland, OH). Generally, a matrix solution of 0.5 M in ethanol 

20 was prepared. Various matrices were used for this Example and Examples 19-21 such 
as 3,5-dihydroxybenzoic acid, sinapinic add, 3-hydTOxypicolinic acid, 2,4,6- 
trihydroxyacetophenone. Oligonucleotides were lyophilized after purification by HPLC 
and taken up in ultrapurb water (MilliQ, Millipore) using amounts to obtain a 
concentration of 1 0 pmoles/fil as stock solution. An aliquot ( 1 id) of this concentration 

25 or a dilution in uhrapure water was mixed with 1 |il of the matrix solution on a flat metal 
surface serving as the probe tip and dried with a &n lisihg cold air. In some experiments, 
cation-ion exchange beads in the acid form were added to the mbtfure of matrix and 
sample solution. 

MAIJ)^T0F spectra were obtained for this Example and Examples 19-21 
30 on different commerdal instruments such as Vi»on 2000 (Finnigan-MAT), VG TofSpec 
(Fisons Instruments), LaserTec Research (Vestec). The conditions for this Example were 
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linear negative ion mode with an acceleration voltage of 25 kV. Th MALDI-TOF 

spectrum generated is shown in FIGURE 14. Mass calibration was done externally and 

generally achieved by using defined peptides of appropriate mass range such as insulin, 

gramicidin S. trypsinogen, bovine serum.albumen, and ^ochrome C. All spectra were 

generated by employing a nitrogen laser with 5 nsec pulses at a wavelength of 337 nm. 

6 7 2 

Laser energy varied between 10 and 10 W/cm . To improve signal-to-noise^ratio 
-generally, the intensities of 10 to 30 laser shots were accumulated. 
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EXAMPLE 19 

Mass Spectrometric Analysis of a SO-mer and a 99-nier 

Two large oligonucleotides were analyzed by mass spectrometry. The 50- 

' mcr 

d (TAACGGTCATTACGGCCATTGACTGTAGGACCTGCATTACATGACTAGCT) 
(SEQ JD N0:3) and dT(pdT)99 were used. The oligodeoxymiclebtides were synthesized 
using P-<^anoethylphosphoanudites and purified using published procedures.(e.g. N.D. 
Sinha, J. Biemat, J. McManus and H. Kfister, NuclricAdds Res 12. 4539 (1984)) 
employing commercially available DNA synthesizers firom either MiOipore (Bedford, MA) 
or Applied Bio^grstems (Foster City, CA) and HPLC equipment and RP18 reverse phase 
columns from Waters (Milford, MA). The samples for mass spectrometric analysis were 
prepared as described in Example 18. The conditions used for MALDI-MS analysis of 
each oligonucleotide were 500 finol of eadi oligonucleotide, reflectron positive ion mode 
with an acceleration of 5 kV and postacceleration of 20 kV. The MALDI-TOF spectra 
generated were superimposed and are shown in FIGURE 1 S. 

EXAMPLE 20 

Simulation of the DNA Sequencing Results of FIGURE 2 

The 13 DNA sequences representing the nested dT-terminated fragments 
of the Sanger DNA sequencing for the 50-mer described in Example 19 (SEQ ID N0:3) 
were synthesized as described in Example 19. The samples were treated and 500 fmol of 
each fragment was analyzed by MALDI-MS as described in Example 18. The resuking 
MALDI-TOF spectra are shown in FIGURE 16. The conditions were reflectron positive 
ion mode with an acceleration of 5 kV and postacceleration of 20 kV, Calculated 
molecular masses and experimental molecular nuisses are shown in Table I . 

The MALDI-TOF spectra were superimposed (FIGURE 1 7) to 
demonstrate that the individual peaks are resolvable even between the lO-mer and 1 1-mer 
(upper panel) and the 37*mer and 38-mer (lower panel). The two panels show two 
different scales and the spectra analyzed at that scale. 
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EXAMPLE 21 

MALDI-MS Analysis of a Mass-Modffied Oligonudeotide 

A 1 T-mer was mass-modified at C-5 of one or two deoxyuridine moieties. 
5-[ 1 3.(2-Methoxyethoxyl)-tridecync-l •yl].5'.0.(4,4'^^ 

P^cyanbethyl-K N-diisoprop^phosphoamidite was used to synthesize the modified 17- 
mers using the methods described in Example 19. 
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The modified 17-mers were 

X - 

I 

a: d (TAAAACGACGGCCAGUG) (molecular mass: S4S4) 
5 (SEQIDN0:4) 

, - X X 

I I 
b: d (UAAAACGACGGCCAGUG) (molecular mass S634) 
10 (SEQ ID N0:5) 

where X = -C»C-(CH2)i j-OH 

(umnodified 17-mer: molecular mass: S273) 

15 

The samples were prepared and 500 finol of each modified 17-mer was 
analyzed using MALDI-MS as described in Example 18. The conditions used were 
refiectron positive ion mode with an acceleration of 5 kV and postacceleration of 20 kV. 
The MALDI-TOF spectra which were generated were superimposed and are shown in 
20 KGURE 18. 

Detection of Polymerase Chain Reaetion Products Containing 7-Deazapurine 

25 

M ATFWf AI q ANn MRTHODS 
PCR amplifications 

The fdlowing oligodeoxynacleotide primers were either synthesized 
according to standard phosphoamidite chemistry (Sinha, N.D,. et al., (1983) Tetrahetton 
30 Let. Vol. 24, Pp. 5843-5846; Sinha. N.D., et al.. (1984) Nucleic Adds Res., Vol. 12, Pp. 
4539-4557) on a MilliGen 7500 DNA synthesizer (Millipore, Bedford, MA, USA) in 200 
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nmol scales or purchased from MWG-Biotech (Ebersberg, Germany, primer 3) and 
Biometra (Goettingen, Germany, primers 6-7). ■ - - 

primer 1: 5'- GTCACCCTCGACCTGCAG SEQ. ID. NO. 6); 

primer 2: 5'- TTGTAAAACGACG6CCAGT (SEQ. ID. NO. 7); 

primer 3: 5'- CTTCCACCGCGATGTTGA (SEQ. ID. NO. 8); 
'pnmer4: 5'- CAGGAAACAGGTATGAC (SEQ. ID. NO. 9); 
.primer 5: 5'- gtaaaacgagggccagt (SEQ. ID. NO. 10); 

primer 6: , 5 • - GTCACCCTCGACCTGCAgC (g: RiboG) (SEQ. ID. NO. 1 1); 
, primer 7: 5»- GTTGTAAAACGAGGGCCAgT (g: RiboG) (SEQ. ID. NO. 12); 

The 99-mer and 200-mer DNA strands (modified and unmodified) as well 

as the ribo- and 7-dea2a-niodified 100-mer were amplified firom pRFcl DNA (10 ng, 
generously supplied S. Feyerabend. University of Hamburg) in 100 nL reaction volume 
containing 10 mmol/L KCI, lOinmol/L (NH4)2S04, 20 nunol/L Tris HCI (pH = 8.8), 2 
mmol/L MgS04. i^^o(-)Pseudococcus fiiriosus (Pfu) -Buffer, Pharmacia, Freibuig, 
(jermany). 0.2 mmol/L each dNTP (Pharrnada, Frd9>urg, Germany), 1 |tmol/L of each 
primer and 1 unit of exo(-)^ DNA polymerase (Stratagene, Heidelberg. Germany). 

For the 99-mer primers 1 and 2, for the 200-mer pnmxs I and 3 and for 
the 100-mer primers 6 and 7 were used. To obtain 7-deazapurine modified nucleic adds, 
during PCR-amplification dATP and dGTP were replaced with 7-deaza-dATP and 7- 
deaza-dGTP. The reaction was peiformed in a thermal cycler (QmniGene, iVfWG- 
Biotech. Ebersbeig. Germany) using the c^e: doiaturation at 95'C for 1 mih., annealing 
atSl'Cfbrl min. and extension at 72'C for 1 min. For all PGRs the number of reaction 
cydes was 30. The reaction was allowed to extwjd for additional 10 min. at 72 °C after 
the last c^e. 

The 103-mer DNA strands (modified and unmodified) were amplified 
fit)m M13mpl8 RFI DNA (100 ng, Pharmacia. Freiburg. Gennany) in 100 nL reaction 
volume using primers 4 and 5 all other concentrations were unchanged. The reaction was 
performed using the cyde: denaturation at 95'*C for 1 min., annealing at 40'C for 1 min. 
and ext«ision at 72 "G for 1 min. After 30 cydes for the unmodified and 40 cydes for 
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the modified 103-iner respectively, the samples were incubated for additional 10 min. at 
72^C - - 

Synthesis of 5'-^^ 'PJ-labeledPCR-primers 

32 

Primiers 1 and 4 were S'-[ -P]-labeied employing T4-polyniicIeotidkinase 
(Epicentre Technologies) and (y-^^P)-ATP. (BLU/NGG/502A, Dupont, Germany) 
according to the protocols of the manufacturer. The reactions were performed 
substituting 10% of primer 1 and 4 in PCR with the labeled primers under otherwise 
unchanged reaction-conditions. The amplified DNAs were sq)arated by gel 
electrophoresis on a 10% polyacrylamide gel. The appropriate bands were excised and 
counted on a Packard TIU-CARB 460C liquid scintillation system (Packard, CT, USA). 

Primer^leqmgeJnmribo-mo<BfiedPCR^pro^ 

The ampUfied DNA was purified using intrafi'ee-MC filt^ units (30,^^^ 
NMWLX it was then redissoived in 100 ^l of 0.2 mol/L NaOH and heated at QS'^C for 25 
minutes. The solution was then acidified with HCl (1 mol/L) and fiirther purified for 
MALDI-TOF analysis employing Ultrafi-ee-MC filter units (10,000 NNTWL) as described 
below. 

Purification of PCR products 

All samples were purified and concentrated using Ultrafi'ee-MC units 
30000 NMWL (Millipore, Eschbom, Gennany) according to the manufacturer's 
description. After lyophilisation, PCR products were redissolved in S tiL (3 ^L for the 
200-mer) of ultrapure water. This analyte solution was directly used for MALDI-TOF 
measurements. 

MALDI-TOFMS 

Aliquots of 0.5 of analyte sohition and 0.5 of matrix solution (0.7 
mol/L 3-HPA and 0.07 mol/L ammonium citrate in acetonitrile/water (1:1, v/v)) were 
mixed on a flat metallic sample support. After drying at ambient temperature the sample 
was introduced into the mass spectrometer for analysis. The MALDI-TOF mass 
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spectrometer used was a Fimugan MAT Vision 2000 (Finnigan MAT, Bremen, 
Germany). Spectra were recorded in the positive ion reflector mode witlwi 5 keV ton 
source and 20 keV postacceleration. The instrument was equipped with a nitrogen laser 
(337 nm wavelength). The vacuum of the system was 3-4* lO"* hPa in the analyzer 
r^on and 1-4* 10 hPa in the source r^on. Spectra of modified and unmodified DNA 
samples were obtained with the same relative laser power; external calibration was 
-"Permed with a mixture of synth^c oligodeoxynudeotides (7-to50-mer). 

RESULTS AND DISCUSSION 

Emymadc synthesis ofT-dcazapurine nudeotide containing nucleic 

acids by PCR 

In ordo- to demonstrate the feasibility of MALDI-TOF MS for the rapid, 
gd-fi-ee anafysis of short PCR products and to investigate the eflFect of 7-dea2apurine 
modification of nudeic adds under MALDI-TOF conditions, two different primer- 
template systems were used to syntheaze DNA fi-agments. Sequences are displayed in 
Figures 24 and 25. While the two single strands of the 103-mer PCR product had nearly 
equal masses (Am= 8 u). the two single strands of the 99-mer differed by 526 u. 

Considering the facts that 7-dea2a purine nudeotide buil(Ung blocks for 
chemical DNA synthesis are approximately 160 times more expensive than regular ones 
(Product Information, Glen Research Corporation, SterUng, VA) and thdr application in 
standard p-cyano-phosphoaraidite chemistry is not trivial (Product Inforrmoioni Glen 
Research Corporation, Steriing, VA; Schndder , K and B.T. Chait (1995) Nucleic Acids 
Res.23, 1570) the cost of 7-deaza purine modified primers wouM be very high. 
Therefore, to increase the appIicabiliQr and scope of the method, all PCRs were 
performed using unmodified oligonudeotide primus which are routinely available. 
Substituting dATP and dGTP by c'hIATP and c'rdGTP in polymerase chain reaction led 
to products containing approximately 80% 7-deaza-purine modified nudeosldes for the 
99-mer and 103-mer; and about 90% for the 200-mer, respectively. Table II shows the 
base compoation of all PCR products. 
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TABLED 

Base composition of the 99-nier, 103-nierand 200>iner PGR amplificatioii products 
(unmodified and 7-^deaza purine modified) 



25 
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"s" and "a" describe "sense" and "antisense" strands of the double-stranded PGR 

product. 
2 

indicates relative modification as percentage of 7-dea2:a purine modified nucleotides of 
total amount of purine nucleotides. 



30 However, it remained to be determmed whether 80-90% 7-dcaza-purine 

modification would be sufficient for accurate mass spectrometer detection. It was 
therefore important to detemiine whether all purine nucleotides could be substituted 
during the enzymatic amplification step, it was found that txoi-^yPseudqcoccusfuriosns 
{Pfii) DNA polymerase indeed could accept c^-dAtP and c^-dGTP in the absence of 

3S unmodified purine triphosphates. However, the incorporation was less efficient leading 
to a lower yield of PGR product (Figure 26). EthidiunKbromidc stains by intercalation 
with the stacked bases of the DN A-douUestrand. Therefore lower band intensities in the 
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etWdium-bromide stained gel might be artifacts since the modified DNA-strands do not 
necessarily need to give the same band intensities as the unmodified ones- 

To verify these results, the PCRs with [^^]-labeled primers were 
repeated. The autoradiogram (Figure 27) cleariy shows lower yields for the modified 
PCR-products. The bands were exdsed from the gel and counted. For all PCR.products 
-the yield of the modified micldc acids was about 50%, referring to the corresponding 
unmodified amplification product. Further expoiments showed that eKo{-)PeepVmt and 
Vent DNA polymerase were able to incorporate c^-dATP and c^-dGTP during PGR as 
wen. The overall peiformance, however, turned out to be best for the exoi-)Pfit DNA 
polymerase giving least side products during amplification. Using all three polymerases, 
it was found that such PCRs employing c^-dATP and c^-dGTP instead of their isosteres 
shovved less ade-reactions giving a cleaner PCR-product. Decreased occurrence of 
amplification side products may be explained by a reduction of primer mismatches due to 
a lower stability of the complec formed fi-om the primer and the 7-deaza-purine 
containing template which is synthesized during PCR. Decreased mehing point for DNA 
duplexes containing 7-dea2a-purine have been described (Mizusawa, S. et al., (1986) 
Nucleic Acids Res.M, 13 19-1324). In addition to the three polymerases specified above 
(exo(-) Deep Vent DNA polymerase. Vent DNA polymerase and exo(-) {Pfu) DNA 
polymerase), it is anticipated that other polymerases, such as the Urge Klenow fragment 
of E. coli DNA polymerase, Sequenase, Taq DNA polymerase, and U AmpIiTaq, 
AmpUTaq or AmpIiTaq TS DNA polymerase can be used- In addition, where RNA is the 
template, RNA potymerases. such as the SP6 or the T7 RNA polymerase, must be used 

MALDJ-TOF mass spectrometry of modified and mmoOfiedPCR 
produas. 

The 99-mer, 103-mer and 200-mer PCR products were analyzed by 
MALDI-TOF MS. Based on past experience, it was knowm that the degree of 
depurination depends on the laser energy used for desorption and ionization of the 
analyte. Since the influence of 7-deazapurine modification on fi^gmentation due to 
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depurination was to be investigated, all spectra were measured at the same relative laser 
energy. _ 

Figures 28a and 28b show the mass spectra of the modified and 
unmodified 103-mer nucleic acids. In case of the modified 103-mer, fi-agmentation 
5 causes a broad (M4^H) signal. The maximum of the peak is shifted to lower masses so 
that the assigned mass represents a mean vahie of (M-4i) signal and signals of 
fragmented ions, rather than the (M+IQ^ signal itself Although the modified 103-mer 
still contains about 20% A and G firom the oligonucleotide primers, it shows less 
fi^gnientation which is featured by much more narrow and symmetric signals. Espedally 

10 peak tailing on the lower niass side due to depurination, is substa^^ Hence, 
the difference between measured and calculated niass is strongly reduced although it is 
still below the expected mass. For the unmbdified sample a (M+H)^ signal of 3 1670 was 
observed, which is a 97 u or 0.3% difference to the calculated mass. While, in case of the 
modified sample this mass dtflference dinunidied to 10 u or 0.03% (31713 u found, 31723 

IS u calculated). These observaticms aie verified by a significant increase in mass resolution 
of the (M+H) signal of the two signal strands (m/Am - 61 as opposed to 1 8 for the 
unmodified sample with Am = fiill width at half maximum, fwhm). Because of the low 
mass difiference between the two single strands (8 u) their individual signals were not 
resoKred. 

20 With the results of the 99 base pair DN A fi^gments the effects of 

increased mass resolution for 7*deazapurine containing DN A becomes even more 
evident. The two single strands in the unmodified sample were not resolved even though 
the mass difference between the two strands of the PCR product was very high with 526 
u due to unequal distribution of purines and pyrimidines (figure 29a). In contrast to this« 

25 the modified DNA showed distinct peaks for the two single strands (figure 29b) which 
makes the superiority of this approach for the determination of molecular weights to gel 
electrophoretic methods even more profound. Ahhdugh base line resolution was not 
obtained the individual masses were abled to be assigned with an accuracy of 0.1%: Am 
- 27 ii for the lighter (calc. niass 30224 u) and Am = 14 u for the heavier strand (calc. 

30 mass » 30750 u). Again, it was found that the fiiU width at half maximum was 
substantially decreased for the 7*deazapurine containing sample. 
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In case of both the 99-nier and 103-iner the 7-deazapurine containing 
nucleic adds seem to give higher sensitivity despite the fact that they stilLcontain about 
20% unmodified purine nucleotides, To get comparable signaJ-to-noise ratio at similar 
intensities for the (M+H) signals, the unmodified 99-mer required 20 laser shots in 
contrast to 12 for the modified one and the 103-mer required 12 shots for the unmodified 
sample as opposed to three for the T-deazapurine nucleoside-containing PCR product. 
' T Comparing the spectra of the modified and unmodified 200-mer 

amplicons, improved mass resolution was ag^n found for the 7-deazapurine containing 
sample as well as increased signal intensities (figures 30a and SQb). While the signal of 
the single strands predominates in the spectrum of the modified sample the DNA-suplex 
and dimers of the single strands gave the strongest signal for the unmodified sample. 

A complete T^deaza purine modification of nucleic acids may be achieved 
either using modified primers in PCR or cleaving the unmodified primers from the 
partially modified PCR product. Since disadvantages are associated with modified 
primers, as described above, a 1 00-mer was synthesized using primers with a ribo- 
modification. The primers were cleaved hydrolytically with NaOH according to a method 
developed earlier in our laboratory (Koester, H. et al., Z Physiol, Chem. 359: 1 570- 
1589). Figures 31a and 31b display the spectra of the PCR product before and after 
primer cleavage. Figure 3 lb shows that the hydrolysis was successfiil: Both hydrolyzed 
PCR product as well as the two released primers could be detected together with a small 
signal fi'om residual uncleaved lOO-mer^ This procedure is especially usefiil ibr the 
MALDl-TOF analysis of very short PCR-products since the share of unmodified purines 
originating fit>m the primer increases with decreasing length of the amplified sequence. 

The remarkable properties of 7-deazapurine modified nucleic acids can be 
explained by either more effective desorption and/or ionization, increased ion stability 
and/or a lower denaturation energy of the double stranded purine modified nucleic acid. 
The exchange of the N*7 fi^r a methine group results in the loss of one acceptor for a 
hydrogen bond which influences the ability of the nucleic acid to form secondary 
structures due to non-Watson-Crick base pairing (Seela, F. and A. Kehne (1987) 
Biochemistry, 26, 2232-2238.), which should be a reason for better desorption during the 
MALDI process. In addition to this the aromatic system of 7-deazapurine has a lower 
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electron density that weakens Watson-Crick base pairing resulting in a decreased mehing 
point (Mizusawa, S. et al., i\9B6) Nucleic Acids Res., 14, 1319-1324) of the double- 
strand. This effect niay decrease the ehergy needed for denaturation of the duplex in the 
MALDI process. These aspects as well as the loss of a site which probably will carry a 
positive charge on the N-7 nitrogen renders the 7-dea:^purine modified nucleic acid less 
polar and niay prohK>te the effie^vehess of Resorption. 

Because of the absence of N-7 as proton acceptor and the decreased 
polarizaiton of the C-N bond in 7-deazapurine nucleosides depurination following the 
mechanisms established for hydrolysis in solution is prevented. Although a direct 
correlation of reactions in sohitton and in the gas phase is problematic, less fragmentation 
due to depiurination of the modified nucleic adds can be expected in the MALDI process. 
Depurtitation may either be accompanied by loss of charge which decreases the total yield 
of charged species or it iiiay produce charged fiagmentation products which decreases 
the intensity of the hbri firagmenied molecular ion sigriai. 

The observation of both increased sen^tivity and decreased peak tailing of 
the (M+H) signals on the lower mass side due to decreased fragmentation of the 7- 
dearapurine containing samples indicate that the N-7 atom indeed is essential for the 
mechanism of depurination in the MALDl-TOF process In conclusion, 7-dea2apurine 
containing nucleic acids show distinctly increased ion-stability and sensitivity under 
MALDI-TOF conditions and therefore provide for higher mass accuracy and mass 
resolution. 

EXAMPLE 2? 
Solid State Sequencing and Mass 5|>ectr6meter Detection 

MATCM ALS AMP M35THQPS 

Oligonucleotides were purchased firom Operon Technologies (Alameda, 
C A) in an unpurified fonh. Their sequences are listed in Table III. Sequencing reactions 
were peiformed on a solid surface using res^gents firom the sequencing kit for Sequenase 
Version 2.0 (Amersham, Arlington Heights, Illinois). 
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S^gtiertcing q fp-m^r torgft - - 

Sequendng complex: 
5 • -TCTGGCCTGGTGCAGGGCCTATTGTAGTTGTGACGTACA- (A**) ^-3 • 
(DNAl 1683) (SEQ. ID. No. 13) 

3 '^TCAACACTGCATGT-S • (PNA16/DNA) (SEQ. ID, No. 14) 

In order to perfonn solid-state DNA sequencing, template strand 
DNAl 1683 was 3'-biotinylated by terminal deoxynudeoddyl transferase. A 30 pi 
reaction, containing 60 pmol of DNAl 1683^ 1.3 nmolof btotin 14-dATP (GIBCO BRL, 
Grand Wand, NY), 30 units of terminal transferase ( Amersham. Arlington Heights, 
Illinois), and Ix reaction buffer (supplied with enzyme), was incubated at 37**C for 1 
hour. The reaction was stopped by heat inactivation of the terminal transferase at 70°C 
for 10 min. The resuhing product was desalted by passing through a TE-10 spin column 
(Clonetech). More than one molecules of biotin- 14-dATP could be added to the 3 -end 
of DNAl 1 683. The biotinylated DNAl 1683 was incubated with 0.3 mg of Dynal 
streptavidin beads in 30 ^l Ix binding and washing buffer at ambient temperature for 30 
min. The beads were washed twice with TE and redissolved in 30 pi TE, 10 pi aliquot 
(containing 0. 1 mg of beads) was used for sequ^cing reactions. 

The 0. 1 mg beads from previous step were resuspended in a I Opl volume 
containing 2 pi of 5x Sequenase buffer (200 mM Tris-HCI. pH 7.5, 100 mM MgCI2, and 
250 mM NaCl) firom the Sequenase kit and 5 pmol of corresponding primer 
PNA16/DNA. The annealing mixture was heated to TO'^C and allowed to cool slowly to 
room temperature over a 20-30 min time period. Thwi 1 pi 0. 1 M dithiothf^tol solution, 
1 pi Mn buffer (0.15 M sodium isodtrate and 0.1 M McC12), and 2 pi of diluted 
Sequenase (3.25 units) were added. The reaction mixture was divided into four aliquots 
of 3 pi each and mixed with termination mixes (each consists of 3 pi of the appropriate 
termination mbc: 32 pM c7dATP, 32 pM dCTP, 32 pM c7dGTP, 32 pM dTTP and 3.2 
pM of one of the four ddTNPs, in 50 mM NaCI). The reaction mixtures were incubated 
at 37'C for 2 min. After the completion of extension, the beads were precipitated and 
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the supernatant was removed. The beads were washed twice and resuspended in T£ and 
kept at 4X. _ - 

S Sequencing complex: 

5 ' -AAGATCTGACCAGGGATTCGGTTAGCGTGACTGCTGCTGCTGCTGCTGCTGC 
TGGATGATCCGACGCATCAGATCTGG- (A^) (SEQ. ID. NO. 15) (TNR.PLASM2) 
3 • -CTACTAGGCTGCGTAGTC-5 ' (CMl ) (SEQ. ID. NO. 16) 

The target TNR.PLASM2 was biotinylated and sequenced u^g 
10 procedures stmSar to those desoibed in previous section (sequencing a 39-mer target). 

Setfuencih^ a JS^er target with partiaUv duplex probe 

Sequehdng complex: 

5 3» (SEQ. ID. No. 17) 

•-F-GATGATCCGACGCATCACAGCTC 

3' 3' (SEQ. ID. No. 18) 

15 -b-CTACTAGGCTGCGTAGTGTCGAGAACCTTGGCT ■ ■ 

CM1B3B was immobilized on Dynabeads M280 with streptavidin (Dynal, 
Norway) by incubating 60 pmol of CM1B3B with 0.3 magnetic beads in 30 IM NaCl 
and TE (Ix binding and washing buffer) at room temperature for 30 min. The beads 

20 were washed twice with TE and redissolved in 30 >il TE, 1 0 or 20 ^1 aliquot (containing 
0. 1 or 0.2 mg of beads respectively) was used for sequencing reactions. 

The duplex was formed by annealing corresponding aliquot of beads from 
previous step with 10 pmol of DFl laSF (or 20 pmol of DFl la5F for 0:2 mg of beads) in 
a 9 |il volume containing 2 nlof 5x Scqumase buffer (200 mM Tris-HCI, pH 7.5, 100 

25 mM MgClU and 250 mM NaCI) from the Sequenasie kit. The annealing mixture was 
heated to 65*'C and allowed to coolslowly to 37''C over a 20-30 nun dnie period. The 
duplex primer was then mixed with 10 ptnol of TSIo (20 pmol of TSIO for 0.2 mg of 
beads) in 1 pi volume, and the resulting mixture was further mcubated at 37'*C for 5 min, 
room temperature for 5*10 min. Then 1 pi 01 M dithiothreitol solution, I pi Mn buffer 

30 (0. 1 5 M sodium isocitrate and 0. 1 M MnClj^X and 2 pi of diluted Sequenase (3 .25 units) 
were added. The reaction mixture was divided into four aliquots of 3 pi each and mixed 
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with tennination mixes (each consists of 4 pi of the appropriate termination mix: 16 
dATP. 16 \iM dCTP, 16 pM dGTP, 16 pM dTTP and 1.6 \iM of one^of4he four 
ddNTPs, in SO mM NaCl). The reaction mixtures were incubated at room temperature 
for S min, and 3VC for 5 min. After the completion of extension, the beads were 
S precipitated and the supernatant was removed. The beads were resuspended in 20 \il TE 
and kept at A^'C. An aliquot of 2 pi (out of 20 from each tube was taken and mixed 
* ' with 8 pi of formamide» the resulting samples were denatured at 90-9S*'C for 5 min and 2 
pi (out of 10 pi total) was applied to an ALF DNA sequencer (Pharmacia, Piscataway, 
NJ) using a 10% polyacrylamide gd containing 7 M urea and 0.6x TBE. The remaining 
10 aliquot was used for MALDI-TOFMSanaIy»s. 

Before MALDI analysis, the sequencing ladder loaded magnetic beads 
were washed twice using 50 mM ammonium citrate and resuspended in 0.5 pi pure 

15 water. The suspension was then loaded onto the sample target of the mass spectrometer 
and 0.5 pi of saturated matrix solution (3-hydropicolinic acid (HP A): ammonium citrate 
= 10: 1 mole ratio in 50% acetonitrile) was added. The mixture was allowed to dry prior 
to mass spectometer analysis. 

Hie reflectron TOFMS mass spectrometer (Vision 2000, Finnigan MAT, 

20 Bremen, Germany) was used for analysis. 5 kV was aiq)lied in the ion source and 20 kV 
was applied for postaccderation. All spectra were taken in the positive ion mode and a 
nitrogen laser was used. Normally, each spectrum was averaged for more than 100 shots 
and a standard 2S-point smoothing was applied. 

25 RESULTS ANP DISCUSSIONS 

In conventional sequencing methods, a primer is directly annealed to the 
template and then extended and terminated in a Sanger dideoxy sequencing. Normally, a 
biotinylated primer is used and the sequencing ladders are captured by streptavidin- 
30 coated magnetic beads. After washing, the products are eluted from the beads using 

EDTA and formamide. However, our previous findings indicated that only the annealed 
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Strand of a duplex is desorbed and th immobilized strand remains on the beads (Tang, K. 
et al., (1995) Nucleic Adds Research 25:3126-3131). Therefore, it is advantageous to 
immolnlize the template and anneal the primer. After the sequencing reaction and 
washing, the beads with the immobilized template and annealed sequendng ladder can be 
loaded directly onto die mass spearometer taxgi^ and mix with matrix. In MALDl, only 
the annealed sequencing ladder will be desorbed and ionized, and the inmiobilized 
. template will remain on the target. 

A 39-mer template (SEQ. ED. No. 13) was first biotinylated at the 3' end 
by adding biotin-14>dATP with tenninal transferase. More than one biotin-I4-dATP 
molecule could be added by the enzyme. However, since the template was immobilized 
and remained on the beads during MALDI, the numb^ of biotin-14-dATP would not 
affect the mass spectra. A 14-mer primer (SEQ. ED. No. 14) was used for the solid-state 
sequencing. MALDI-TOF mass spectra of the four sequencing ladders are shown in 
Figure 32, and the expected theoretical values are shown in Table III. The sequendng 
reaction produced a relatively homogenous ladder, and the iull-length sequence was 
determined easily. One peak around 5 1 SO appeared in all reactions are not identified. A 
possible &cplanation is that a small portion of the tmplate formed some kind of 
secondary structure, such as a loop, which hindered sequenase extension. Nfis- 
incorporation is of minor importance, since the intensity of these peaks were much lower 
than that of the sequencing ladders. Although 7-deaza purines were used in the 
sequencing reaction, which could stabilize the N-glycosidic bond and prevent 
depurination, minor base losses were still observed since the primer was not substituted 
by 7-deazapurines. The fiiU length ladder, with a ddA at the Y end, appeared in the A 
reaction with an apparent mass of 1 1899.8. However, a more intense peak of 122 
appeared in all four reactions and is likely due to an addition of an extra nucleotide by the 
Sequenase enzyme. 
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The same technique could be used to sequence longer DNA fragments. A 
78-nier tonpiale containing a CTG repeat (SEQ. ID. No. IS) was3'-biotinylated by 
adding biotin-14-dATP with terminal transferase. An 18-mer primer (SEQ. ID. No. 16) 
was annealed right outside the CTG repeat so that the repeat could be sequenced 
immediately after primer extension. The four reactions were washed and analyzed by 
MALDI-TOFMS as usual. An example of the G-reaction is shown in Figure 33 and the 
^ expected sequencing ladder is shown in Table IV with theoretical mass values for each 
ladder component. All sequencing peaks were well resolved except the last component 
(theoretical value 20577.4) was indistmguishable from the background. Two ndghboring 
sequencing peaks (a 62-iiier and a 63-mer) were also separated indicating that such 
sequencing analysis could be applicable to longer templates. Again, an addition of an 
extra nucleotide by the Sequenase enzyme was observed in this spectrum. This addition 
is not template specific and appeared in all four reactions which makes it easy to be 
identified. Cbmpared to the primer peak, the sequencing peaks were at much lower 
intensity in the long template case. Further optimization of the sequencing reaction may 
be required. 
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TABLE XV (Contxnxied) 



ddKT ddCTP ddSTP ddTTF 

1. 5491.6 5491.6 5491.6 5491*6 - ~ 

2. 5764.8 
5 3. 6078.0 

4. 6407.2 

5. 6696.4 

6. 7009,6 

7. 7338.8 
10 8. 7628.0 

9. 7941.2 

10. — - 8270.4 

11. 8559.6 

12. 8872.8 

15 13. 9202.0 

14. 9491.2 

15. 9804.4 

16. 10133.6 

17. 10422.8 
20 18. 10736.0 

19. 11065.2 

20. 11354.4 

21. 11667.6 . 

22. 1196^8 
25 23. 12286.0 

24. 12599.2 

25. 12928.4 

26. 13232.6 

27. 13521.8 
30 28. 13835.0 

29. 14124.2 

30. 14453.4 

31. 14742.6 

32. 15046.8 

35 33. 15360.0 

34. 15673.2 

35. 15962.4 

36. 16251.6 

37. 16580.8 
40 38. 16894.0 

39. 17207.2 

40. . 17511.4 

41. 17800.6 

42. 18089.8 
45 43. 18379.0 

44. 18683.2 

45. 19012.4 

46. 19341.6 

47. 19645.8 
50 48. 19935.0 

49. 20248.2 

50. 20577.4 

51. 20890.6 

52. 21194.8 
55 53. 21484.0 

54. 21788.2 

55. 22092.4 



60 



wo 97/37041 PCT/US97/04394 

- 77 - 

Seqtiertcing using duplex DNA probes for capturing and priming 
Duplex DNA probes with single-stranded overhang have been demonstrated to 
be able to capture specific DNA templates and also serve as primers for solid-estate sequencing. 
The scheme is shown in Figure 34. Stacking interactions between a duplex^probe and a single- 
5 stranded template allow only S-base overhand to be sufficient for capturing. Based on this 
format, a 5* fluorescent-labeled 23-mer (5'-GAT GAT CCG ACG CAT CAC AGC TC) (SEQ. 
ID. No. 19) was annealed to a 3'-biotinylated 18-mer (5'-GTG ATG CGT CGG ATC ATC) 
(SEQ. ID. No. 20), leaving a 5-base overhang. A 15-mer template (5'-TCG GTT CCA AGA 
GdOTSEQ ID. No. 21) was captured by the duplex and sequencing reactions were performed 

10 by extension of the S-base overhang. MALDI-TOF mass spectra of the reactions are shown in 
Figure 35. All sequencing peaks were resolved although at relatively low intensities. The last 
peak in each reaction is due to unspecific addition of one nucleotide to the fiiU length extension 
product by the Sequenase enzyme. For comparison, the same products were run on a 
conventional DNA sequencer and a stacking fluorogram of the results is shown in Figure 36. 

15 As can be seen from the Figure, the mass spectra had the same pattern as the fluorogram with 
sequencing peaks at much lower intensity compared to the 23-mer primer. 

Improvements of MALDI-TOF mass spectrometry as a detection technique 
Sample distribution can be made more homogenous and signal intensity could 

20 potentially be increased by implementing the picoliter vial technique. In practice, the samples 
can be loaded on small pits with square openings of 100 um size. The beads used in the solid- 
state sequencing is less than 10 um in diameter, so they should fit well in the microliter vials 
Kficrocrystals of matrix and DNA containing "sweet spots" will be confined in the vial. Since 
the laser spot size is about 100 fim in diameter, it will cover the entire opening of the vial. 

25 Therefore, searching for sweet spots will be unnecessary and high repetition-rate laser (e.g. 
>10Hz) can be used for acquiring spectra. An earlier report has shown that this device is 
ciq>able of increasing the detection sensitivity of peptides and proteins by several orders of 
magnitude compared to conventional MALDI sample preparation technique. 

Resolution of MALDI on DNA needs to be further improved in order to extend the 

30 sequencing range beyond 100 bases. Currently, using 3-HP A/ammonium citrate as matrix and a 
reflectron TOF mass spectrometer with 5kV ion source and 20 kV postacceleration, the 
resolution of the run-through peak in Figure 33 (73-mer) is greater than 200 (FWHM) which is 
enough for sequence determination in this case. This resolution is also the highest reported for 
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MALDI desorbed DNA ions above the 70-mer range. Use of the delayed extraction technique 
may further enhance resolution. 

: All of the above-cited references and publications are hereby incorporated by 

reference. 

5 EQUIVALENTS 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation, numerous equivalents to the specific procedures described heran. 
Such ixfuivalents are consider^ be within the scope of this invention and are covered by the 
following daims. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Koster, Hubert 
Ui) TITLE OF INVENTION: DNA SEQUENCING BY MASS SPECTROMETRY 
(ill) NUMBER OF SEQUENCES: 21 

(Iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Patent Group 

Foley, Hoag & Eliot LLP 

(B) STREET: One Post Office Square 
{C) CITY: Boston 

(D) STATE: MA 

(E) COUNTRY: USA 

(F) ZIP: 02109-2170 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

iC) OPERATING SYSTEM: PC-DOS/MS-DOS 
(D) SOFTWARE: ASCII (text) 

<vi) CURRENT APPLICATION DATA: 
<A) APPLICATION NUMBER: 

(B) FILING DATE: 18-MAR-1997 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/617,010 
(B> FILING DATE: 18-MAR-1996 

(Viii) PRIOR APPLICATION DATA: 

tA) APPLICATION NUMBER: 08/178,216 

(B) FILING DATE: 06-JAN-1994 

(C) CLASSIFICATION: 

(ix) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Arnold, Beth E. 

(B) REGISTRATION NUMBER: 35,430 

(C) REFERENCE/DOCKET NUMBER: SQA-3.25.27 
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(X) TELECOMMUNICATION INFOmATION: 

(A) TELEPHONE: (617) 832-1294 

(B) TELEFAX: (€17) 832-7000 

5 

(2) INFORMATION FOR SEQ ID N0:1: 



(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 14 base pairs 
10 ' (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(O) TOPOLOGY: linear 



15 



(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



20 



CATGCCATGG CATG 

(2) INFORMATION FOR SEQ ID NO: 2: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

30 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 
AAATTGTGCA CATCCTGCAG C 

40 

(2) INFORMATION FOR SEQ ID NO: 3: 



35 



<i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: YES 



10 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 
TAAC6GTCAT TACGGCCATT GACTGTAGGA CCTGCATTAC ATGACTAGCT 50 

15 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: other nucleic acid 

(iii) HYPOTHETICAL: YES 

30 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
TAAAACGACG GGCCAGXG 17 

35 

(2) INFORMATION FOR SEQ ID NO: 5 

(ij SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 



(iii) HYPOTHETICAL: YES 



5 

(xi) SEQUENCE DESCRIPTION: 5E0 ID NO: 5: 



XAAAACGACG GGCOVGXG 

10 



(2) INFORMATION FOR SEQ ID NO: 6: 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) 5TRANDEDNESS: single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

25 

GTCACCCTCG ACCTGCAG 

(2> INFORMATION FOR SEQ ID N0:7: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 

<ii) MOLECULE TYPE: cDNA 



40 <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



TTGTAAAACG ACGGCCAGT 
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(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

ISCTTCCACCGC GATGTTGA 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 
20 CA) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

30 

CAGGAAACAG CTATGAC 

(2) INFORMATION FOR SEQ ID NO: 10: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

40 

(ii) MOLECULE TYPE: cDNA 
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(xl) SEQUENCE DESCRIPTION: 5EQ ID NO:I0: 
GTAAAACGAC GGCCAGT 17 
5 (2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHAJRACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 
10 - ' " (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

15 

(ix) FEATURE: 

(A) NAME/KEY: misc_£eature 

(B) LOCATION: 1,.19 

(D) OTHER INFORMATION: /note= "All lowercase letters 
20 represent RiboG** 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

25 GTCACCCTCG ACCTGCAgC 19 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 
30 {A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: misc^feature 
40 (B) LOCATION: 1..20 

{D) OTHER INFORMATION: /note= "All lowercase letters 
represent RiboG" 
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(xi) SEQUENCE DESCRXE'TION : SEQ ID NO: 12 
GTT6TAAAAC GAGGGCCAgT 
S (2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 
10 — (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
TCTGGCCTGG TGCAGGGCCT ATTGTAGTTG TGACGTACA 
20 (2) INFORMATION FOR SEQ IQ NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

lii> MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:14: 
TCAACACTGC ATGT 
35 (2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ZD NO: 15: 
5 AAGATCTGAC CAGGGATTCG GTTAGCGTGA CTGCTGCTGC TGCTGCTGCT GCTGGATGAT 60 
CCGAC6CATC AGATCTGG 78 
(2) INFOPMATION FOR SEQ ID NO: 16: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CTACTAGGCT GCGTAGTC 18 
25 (2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

35 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

GATGATCCGA CGCATCACAG CTC 23 

40 (2) INFORMATION FOR SEQ ID NO: IS: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 33 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

5 (ii) MOLECULE TYPE: cDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 

10 CTACTAJSGCT GCGTAGTGTC GAGAACCTTG GCT 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

25 

GATGATCCGA CGCATCACAG CTC 

(2) INFORMATION FOR SEQ ID NO: 20: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 

(ii) MOLECULE TYPE: cDNA 

y 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

40 

GTGATGCGTC GGATCATC 



{2) INFORMATION FOR SEQ ID NO: 21: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 

5 <D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDMA 



10 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:21: 
TCGGTTCCAA GAGCT 15 
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CLAIMS 

1 . A method for determining the sequence of a nudetc acid, comprising the 
steps of: 

a) generating at least two base-specifically terminated nucleic add 
fragments containing modified purine nucleotides that are relatively 
resistant to firagmentation during mass spectrometry; 

b) determining the molecular wdght value of each base-specifically 
tenninated fingment by mass spectrometry, wh^ein the molecular weight 
valu^ of at least two base-spedfically terminated fi-agments are 
determined concurrently; and 

c) determining the sequence of the nucleic acid by aligning the base-specifically 
terminated nucleic acid fragments according to molecular vyeight. 

2. The method according to claim I wherein the nucleic acid fragments are 
purified before the step of determining the molecular weight values by mass 
spectrometry. 

3. The method aqcordtng to claim 2 wherein the nucleic add fitigments are 
purified, comprising the steps of: 

a) reversib^ immobilizing the nudeic acid Augments on a solid 
support; and 

b) wasiiing out all remaining reactants and by-products. 

4. The method according to claim 3, further comprising the step of removing the 
nucleic acid fragments from the solid support. 
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5. The method of claim 1» wherein the fragments contain deazapurine moieties. 

6. The method of claim 1 , wherein the deaza purine moieties are selected from 
S the group consisting of: C^-deazaadenine, C^-deazaguanine, 7-deazainosine 

triphosphate, C'*deazaadenine, C'-deazaguanine and C'-deazatnosine 
triphosphate. 

7. The method of claim 1; wherein at least about 50% of the purine nucleotides 
10 are modified within the nucleotide fragnient. 

8. A process of claim i wherein the mass spearometer is selected from the 
group consisting of: Matrix^Assisted Laser Desorption/Ionization Time-of-Flight 
(MALDI-TOFX Electrospray (ES), Ion Cyclotron Resonance (ICRXand Fourier 

1 S Transform and combinations thereof 

9. The method according to claim 1, i^erdn more than one species of nucldc 
acid are concurrently sequenced by multiplex mass spectrometric nucleic acid 
sequencing employing nucleic acid primers, chain-elongating nucleotides, and 

20 chain-terminating nucleotides, wherein one of the sets of base-specifically 

terminated fragments is unmodified and the other sets of base-specifically 
terminated nucleic acid fragments are mass modified, and each of the sets of 
base-specifically terminated nucleic acid fi^gments has a sufficient mass 
difference to be distinguished from the others by mass spectrometry. 



25 



10. The method accorcfing to claun 9, wherein at least one of the sets of mass- 
modified base-spedfically terminated fragments is niodified with a mass- 
modifying functionality at a heterocyclic base of at least one nucleotide. 



30 



1 1 . The method according to claim 10, wherdn the heterocyclic base-modified 
nucleotide is selected from the group conasting of a cytosine nucleotide 
modified at C-5, a thymine nucleotide modified at C-5, a thymine nucleotide 
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modified at the C-5 methyl group, a uracil nucleotide modified at C-5, an adenine' 
nucleotide modified at an adenine nucleotide modified at C-7. a c^- 
deazaadenine modified at a c^-deazaadenine modified at C=7, a guanine 
nucleotide modified at C-8, a guanine nucleotide modified at C-7, a c^- 
deazaguanine modified at C-8, a c^-deazaguanine modified at C-7, a 
hypoxamhine modified at C-8, a c^-deazahypoxanthine modified at C-7, and a c^* 
deazahypoxanthine modified at C*8. 

12. The method according to claim 9» wherein at least one of the sets of mass- 
modified base-spedficaOy terminated nucleic acid fiagments is modified with a 
mass*modifying functionality attached to one or more phosphate moieties of the 
intemucieotidic linkages of the figments. 

13. The method according to claim 9, wherein at least one of the sets of mass- 
modified base-specifically terminated iiucleicacid fragments is modified with a 
mass-modifying functionality attached to one or more sugar moieties of 
nucleotides within the set of mass modified baseTspecifically terminated 
fragments at at least one sugar position selected from the group consisting of a 
C-2' position, an external C- 3' position, and an external C-5' position. 

14. The method according to claim 9, wherein at least one of the sets of mass- 
modified base-specifically terminated nucleic add firagments is modified with a 
mass-modifying fimctionality (M) attached to the sugar moiety of a 5 -termmal 
nucleotide and wherein the mass-modifying function (M) is the linicing 
functionality (L). 

15. The method according to claim 9, wherein a mass-modifying fimctionality 
(M) is attached to a set of base-specifically terminated nucleic acid Augments 
subsequent to generating the base-spedficaify terminated nucleic acid firagments 
and prior to determining the molecular weight values for the nested fi^agments by 
mass spectrometry. 
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16. The method according to claim 15» wherein the base-specifically terminated 
nucleic acid fragments are generated using at least one reagent selected from the 
group consisting of a nucleic acid primer, a chain-elongating-nucleotide, a chain- 
terminating nucleotide and a tag probe which has been modified with a precursor 
of the mass*modiiying fiincuonality, M; and a subsequent step comprises 
modifying the precursor of the mass-modiiying functionality to generate the 
mass-modifying fiinctionality, M; prior to mass spectrometric analysis. 

17. Thb method according to claim 9, wherein mass differentiation of the tag 
probes is achieved by changing the nucleotide composition of at least one of the 
tag probes and complementary tag sequence in the spiecies of nucleic acid. 

18. The method according to claim 9, wherein the tag probes are covalently 
bound to the coit^|x>ndii^ a>mplementary tag sequence prior to mass 
spectronieiric analysis. 

19. The m^Hbd according to claim 18, wherein binding between the tag 

probes and the corresponding complementary tag sequences is achieved 
photbchemically via photoactivatable groups. 

20. A method of sequencing a nucleic acid, comprising the steps of 

a) reversibly linking an oligonucleotide primer to a solid support; 

b) generating at least two base-specifically terminated nucleic acid 
fitigments containing nucleotides that are relatively resistant to 
fragmentation during mass spectrometry; 

c) determining the mdlecular weight value of each nested fragment 

in each of the four sets of base-specifically terminated Augments of the nuddc 
acid by matrix assisted laser desorption/ionization mass spectrometry 
wherein the molecular wdght values of at least two base*specifically terminated 
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fragments are determined concurrently and wherein the nested fragments are 
deaved fironi the solid support by a laser during mass spectrometry; and 

d) determining the nucleotide sequence by aligning the base 
specifically terminated fragments according to molecular weight. 

21. The method according to daim 20 wherein the nudeic add fragments are 
purified before the step of determining the molecular wdght values by mass 
spectrometry. 

22. The method according to daim 21 wherein the nucldc acid fragments are 
purified, comprising the steps of: 

a) reversibly inrniobilizing the nucleic acid fragments on a solid 
support; and 

b) washing out all remaining reactants and by-products. 

23. The method according to claim 22, further comprising the step of 
removing the nucleic acid fragments from the solid support. 

24 . The method of claim 20, wherein the fragments contain deazapurine 
.moieties. 

25. The method of claim 20, wherein the deaza purine moieties are selected 

7 7 
from the group consisting of: C -deazaadenine, C -deazaguanine, 7- 

9 9 9 

deazainosine triphosphate, C -deazaadenine, C -deazaguanine and C - 

deazainosine triphosphate; 

26. The method of claim 20, wherein at least about 50% of the purine 
nucleotides are modified within the nucleotide fragment. 
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27. A process of claim 20 wherein the mass spectrometer is selected from the 
group con»5ting of Matrix* Assisted Laser Desorption/lonization Time-of-Flight 
(MALDI-TOF), Electrospray (ES), Ion Cyclotron Resonance (ICR), and Fourier 
Transfoim and combinations thereof 

5 

28. The method according to claim 20, wherein more than one species of nucleic 
acid are concurrmtly sequenced by multiplex mass 

sequendiig employing nudric add primers, chain-elongating nudeotides» and 
chain-terminating nudeotides, wherein one of the sets of base-specifically 
1 0 terminated fragments is unmodified and the other sets of base-specifically 

terminated nucleic add fi:agments are mass modified, and each of the sets of 
base-specifically terminated nuddc acid firagments has a suffident mass 
difference to be distinguished firom the others by mass 
s()ectr6metry. 

15 

29. The method according to claim 28, wherein at least one of the sets of mass- 
mbdtfied base-specifically terminated fi-agments is modified with a mass- 
modifying functionality (M) at a heterocyclic base of at least one nudeotide. 

20 30. The method according to claim 29, wherein the heterocyclic base-modified 

nucleotide is selected from the group consisting of a cytostne nucleotide 

modified at C-5, a thymine nudeotide modified at C-5, a thymine nucleotide 

modified at the C-5 methyl group» a uracil nucleotide modified at C-S, an adenine 

7 

nucleotide modified at C-8^ an adenine nucleotide modified at C-7, a c - 

7 ■ 

25 deazaadenine modified at C-8, a c -deazaadenine modified at C-7, a guanine 

nudeotide modified at C-8, a guanuie nucleotides nft>dified at C-7, a c - 

deazaguanine modified at C-8, a c -deazaguanine modified at C-7, a 

7 

hypoxanthine modified at C-8, a c ^eazahypoxanthihe modified at C-7, and a 
7 

c -deazahypoxanthine modified at C-8. 



30 



3 1 . The method according to claim 28, wherein at least ohe of the sets of mass- 
modified base-spedfically t rminated nudeic acid fiagments is modified with a 
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mass-modifying functionality (M) attached to one or more phosphate moieties of 
the intemucieotidic linkages of the fragments. 

32. The method according to claim 28, herein at least one of the sets of mass- 
modified baserspedfically terminated nucldc acid fragments is modified with a 
mass-modifying functionality (M) attached to one or more sugar moieties of 
nucleotides within the set of mass modified base-specifically terminated 
fragments at at least one sugar position selected from the group consisting of a 
C-2* position, an external C- 3* position, and an external C-5' position. 

33. The method according to claim 28, wherein at least one of the sets of mass- 
modified base-specifically terminated nucleic acid fragments is modified with a 
mass-modifying functionality (M) attached to the sugar moiety of a S -terminal 
nucleotide and wherein the mass-modifying function (M) is the linking 
functionality (L). 

34. The method according to claim 28, wherein a mass-modifying functionality 
(M) is attached to a set of base-specifically terminated nucleic acid fragments 
subsequent to generating the base-specifically terminated nucleic acid fi-agmmts 
and prior to determining the molecular weight values for the nested fragments by 
niass spectrometry. 

35. The method accordiiig to claim 34, wherdn the base-specifically 
termiiuited nucleic add fragments are generated using at least one reagent 
sdected from the group consisting of a nucleic acid primer, a chain-elongating 
nucleotide, a chain-terminating nucleotide and a tag probe which has been 
niodified with a precursor of the mass-modifying fiinctionality, M; and a 
subsequent step comprises modifying the precursor of the mass-modifying 
functionality, M, to generate the mass-modifying functionality, M, prior to mass 
spectrometric analysis. 
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36. The method according to claim 28, wherein mass difFerentiation of the tag 
probes is achieved by changing the nucleotide composition of at least one of the 
tag probes and complementary tag sequence in the species of nucleic acid. 

37 The method according to claim 28, wherein the tag probes are covalently 
bound to the corresponding complementary tag sequence prior to mass 
spectrometric analyas. 



38. The method according to claim 37, wherein binding between the tag probes 
10 and the corresponding complementaiy tag sequences is achieved 

photochemically via photoactivatable groups. 

39. A method of multiplex anal^is of nucleic acid sequences, comprising the 
steps of 

IS a) reversibly linking a nucleic acid primer to a solid support; 

b) generating at least two conditioned, base-spedfically temninated nucleic acid 
fragments containing modified purine nucleotides that are relatively resistant to 
fragmentation during mass spectrometry; 

20 c) determining the molecular weight value of each fragment by matrix assisted 

laser desorption/ionization mass spectrometry wherein the molecular weight 
values of at least two base-specifically terminated fragments are determined 
concurrently and wherein the fragments are cleaved firom the solid support by a 
laser during mass spectrometry; and 

25 

d) determining the nucleotide sequence by aligning the fragmerits according to 
molecular weight; wherdn at least one reagent sdected from a group consisting 
of, a nudeic acid primer, a diain-elongating nucleotide, and a diain-terminating 
nucleotide which has been mass-modified; wherein each set of base-spedfically 
30 terminated fragments has a su&ident mass difference from the other sets of base- 

specifically terminated fragments so as to be unique; and wherein the molecular 
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weight values of the nested fragments of two or more sets of unseparated base- 
specifically terminated fragments are determined concurrently. 

40. The method according to claim 39, wherein the reversible linkage is 
S aphotocleavable bond. 

41. The method according to claim 39 wherdn the base-spedfically tmninated 
' ^ fragments are cleaved from the soGd support prior to mass spectrometry. 
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FIGURE 2 

Figure 2A: Sanger DHA Sequeheing Reaction Products with 
ddT Termination ot KypotheticaX DKA . (50-mer> « 

5 * -dTAAC6GTC2lTTAC6GCCATT6ACT6TA66ACCT6CATTACAT6ACTAGCT 



S' 


-dT 


226.23 


5» 


-dTAACGGT 


2104.45 


5» 


-dTAACGGTCAT 


30X1«04 


S« 


-dTAACGGTCATT 


3315.24 


S" 


-dTAACGGTCATTACGGCCAT 


S771.82 


S» 


"dTAACGGTCATTACGGCCATT 


6076.02 


5* 


-dTAACGGTCATTACGGCCATTGACT 


7311. 62- 


5» 


-dTAACGGTCATTACGGCCATTGACTCT 


7945.22 


S« 


•-dTAACGGTCATTACGGCCATTGACSGTAGGACCT 


101X2.63 


5« 


-dTAACGGTCATTACGGCCATTGACTGTAGGACCTGCAT 


1X348.43 


S« 


-dTAACGGTCATTACGGCCATTGACTGTAGGACCTGCATT 


11652.62 


S» 


-dTAACGGTCATTACGGCCATTGACTGTAGGACCTGCATTACAT 


12872.42 


5» 


-dTAACGGTCATTACGGCCATTGACTGTAGGACCTGCATTACATGACT 


14108.22 


S« 


-dTAACGGTCATTACGGCCATTGACTGTAGGACCTGCATTACATGACTAGCX 


15344.02 



Figure 2B: Idealized KALDI-TOF Haas Spectrum Showing the 
"ion current vs. time/molecular weight". 
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Figure 3A: Sanger DMA sequencing Reaction Products .vltli 
- ddA Teraiaation e£ ftypethetieal DMA (SO-mer) 





-dTA3kCS6TCATXACG6CC3kTTGACTGTA6GACC7GCATTACATGAC7A6CT 






-dTA 


S3 0.43 


s« 








-dTAACGGTCA 


2697.83 




--dXAACGGTCATTA 


3dX9«43 


5* 


-dTAACGQTCATTACGGCCA 


5458.61 


5« 


-dTAACGGTCATTACGGCCATXGA 


6709.42 


5» 


-dTAACGGTCATTACGGCCATTGACTGTA 


8249.42 


5" 


-dTAACGGTCATTACGGCCATTGACXCTAGGA 


9221.05 


5» 


-dTAACGGTCATTACGGCCATTGACTGTAGGACCTGCA 


11035.22 


S» 


-dTAACGQTCATTACGGCOATTGACTGTAGGACCTGCATTA 


11956. B2 


5» 


-dTAACGGTCATTACGGCCATTGACTGTAGGACCXGCATTACA 


12559,21 


5> 


-dTAACGGTCATTACGGCCATXGACTGTAGGACCTCCATTACATGA 


13505.83 


S» 


-dTAACCGTCATTACGGCCATTGACTGTAGGACCTGCATTACATGACTA 


14412.42 



Figure 3B: Idealized maldi-TOP Mass Spec trim showing the 
"ion current vs. time/molecular veight". 
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Figure 4 A: Sanger DNA sequencing Reaction Products- wi«i 
ddG Termination of Hypothetical DMA (50-mer). 

5 • -dTAACGGTCATTACGGCCATTGACTGTAGGACCTGCATTACATGACTAGCT 

5*-dTAACG 147 X. 05 

S«-dTAACGG 1800.25 

5**dTAACGGTCATTAC6 4246.84 

5 1 -dTAACGGTCATTACGG 4576,05 

5<~dTAACGGTCATTACGGCCATT6 6405.23 

5 • -dTAACGGTCATTACGGCCATTGACTG 7 641 . 03 

5 » -dTAACGGTCATTACGGCCATTGACTGTAG BS87 . 64 

S » -dTAACGGTCATTACGGCCATTGACTGTAGG 8916*65 

5» -dTAACGGTCATTACGGCCATTGACTGTAGGACCTG 104 41.84 

5 • -dTAACGGXCATTACGGCCATTGACTGTAGGACCTGCATTACATG 132 01 • 63 

5 • --dTAACGGTCAI^ACGGCCAtTGACTGTAGGACCTGCATTACATG ACTAG 14 7 5 0 • 64 

Figure 4B: Idealized HAX*bl-TOF Mass Spectrun Showing the 
**ion current vs. time/molecular weight". 
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Pigure 5A: Sanger DKA Sequencing Reaction Products. with 
ddC Termination of Hypothetical BHA (50-^aer) • 



5 


»-dTAACGGTCATTACGGCCATTGACTCTAGGACCTGCATCACATGACTAGCr 




5^ 


' -dTAAC 


114I.B4 




5' 


-dTAACGGTC 


2393.63 




-dTAACGGTCATTAC 




S« 


-dTAACGGTCATTACGGC 


4865.23 


5« 


«-dTAAC667CATTAC66CC 


5iS4.42 


S« 


«*dTAACGGTCAraACG6CCATTGAC 


7007.62 


5^ 


•dTAACGGTCATTACGGCCATTGACTGTAGGAC 


9519.25 


5' 


' -dTAACGGTCATTACGGCCATTGACTGTAGGACC 


9808.43 


5' 


• -dTAACGGTCATTACGGCCATTGACTGTACGACCTGC 


1073X.02 


5' 


'-dTAACGGTCATTACGGCCATTGACTGTAGGACCTqCATTAC 


12255.02 


5' 


'-dTAACGGTCATTACGCqCATTGACTGTAGGACCTGCATTACATGAC 


13804.02 


5' 


'-dTAACGGTCATTACGGCCATTGACTGTAGGACCTGCATXACATGACTAGC 


15039.82 



Figure 5B: Idealized MALDI-TOP Mass Bpectrim showing tbe 
"ion current vs. tine/aolecular veight«». 



1. 




1 3»-Sae ^®^9ht increments used: 305.20412 

210.21258 
304.1-^618 
313.20954 

il ^ 289.18454 



3 • -ddC 

12 dTp 

13 dAp 
12 dCp 



^0 lU^Sl^tides 329.20894 



FIGURE 5 



wo 97/37041 



6/43 



PCT/17597/04394 



FIGURE 6 



Figure 6: Series o£ Koleeular Weights of the BypotheticaX 
DKA (SO-^aer) after Sanger Sequencing tind 
Termination vith eitber ddTPX, dOATP, ddGTP or 
ddCTP respectively. 
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FIGURE 7 
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FIGURE 8 

Nucleoside Triphosphate Eloagators: 
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FIGURE 9 
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FIGURE 10 
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FIGURE 11 
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FIGURE 12 



e.g. for dT-Tcrminations 
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»g.f r dT-Terminations 



I 




dCTP® 
dCTP^ 




ddOTpO 

dATpO 
dGTpO 
dCTP<> 



T>MA 




ddTTpO 
dTTP® 
dATpO 
dGTpO 
dCTpO 



ddTTP® 
dTTpO 
dATpO 
dGTpO 
dCTpO 




dTTpO 
dATpO 
dGTP<> 
dCTPO 



FIGURE 13. 



wo 97/37041 



14/43 



PCT/US97/D4394 




FIGUSE 14 
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FIGURE 16 
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FIGURE 17 
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FIGURE 18 
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FIGURE 20, CONT 
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FIGURE 23 
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FIGURE '24 
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FIGURE 25 



• . TCACCCCCACCMAATS 



WO97A37041 

27/43 



PCT/US97/04394 



FIGURE 26 
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FIGURE 27 
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FICURE 28 
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FIGURE 29 
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FIGURE 31 
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